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REFERENCE TO RELATED APPLICATIONS 

This application claims priority to U.S. provisional patent applications 60/296,076 
5 filed 6/5/2001 , 60/328,605 filed 10/10/2001 , and 60/357,253 filed 2/15/2002. The 
contents of the prior applications are hereby incorporated in their entirety. 



BACKGROUND OF THE INVENTION 

The p53 gene is mutated in over 50 different types of human cancers, including 
10 familial and spontaneous cancers, and is believed to be the most commonly mutated gene 
in human cancer (Zambetti and Levine, FASEB (1993) 7:855-865; HoUstein, et aU 
Nucleic Acids Res. (1994) 22:3551-3555). Greater than 90% of mutations in the p53 gene 
are missense mutations that alter a single amino acid that inactivates p53 function. 
Aberrant forms of human p53 are associated with poor prognosis, more aggressive tumors, 
15 metastasis, and short survival rates (Mitsudomi et ai., Clin Cancer Res 2000 Oct; 
6(10):4055~63; Koshland, Science (1993) 262:1953). 

The human p53 protein normally functions as a central integrator of signals including 
DNA damage, hypoxia, nucleotide deprivation, and oncogene activation (Prives, Cell 
(1998) 95:5-8). In response to these signals, p53 protein levels are greatly increased with 
20 the result that the accumulated p53 activates cell cycle arrest or apoptosis depending on 
the nature and strength of these signals. Indeed, multiple lines of experimental evidence 
have pointed to a key role for p53 as a tumor suppressor (Levine, Cell (1997) 88:323-331). 
For example, homozygous p53 "knockout" mice are developmentally normal but exhibit 
nearly 100% incidence of neoplasia in the first year of life (Donehower et ah, Nature 
25 (1992) 356:215-221). 

The biochemical mechanisms and pathways through which p53 functions in normal 
and cancerous cells are not fully understood, but one clearly important aspect of p53 
function is its activity as a gene-specific transcriptional activator. Among the genes with 
known p53-response elements are several with well-characterized roles in either regulation 
30 of the cell cycle or apoptosis, including GADD45, p21/Waf 1/Cipl, cyclin G, Bax, IGF- 
BP3, andMDM2 (Levine, Cell (1997) 88:323-331). 

Microtubules have a central role in the regulation of cell shape and polarity during 
differentiation, chromosome partitioning at mitosis, and intracellular transport. 
Microtubules undergo rearrangements involving rapid transitions between stable and 
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dynamic states during these processes. Microtubule affinity regulating kinases (MARK) 
are a novel family of protein kinases that phosphorylate microtubule-associated proteins 
and trigger microtubule disruption (Drewes, G., et at. (1997) Cell 89: 297-308). 

Microtubule affinity regulating kinase 1 (MARK1) is a serine/threonine Mnase that 
5 phosphorylates microtubule-associated protein tau, leading to disruption of microtubules. 
It shares 90% amino acid homology with the rat version of MARK1, and demonstrates 
ubiquitous expression with highest levels in testis and brain (Nagase, T. et al. (2000) DNA 
Res. 7: 143-150). 

EMK1 (MARK2) is a serine/threonine protein kinase with two isoforms, which differ 
10 by the presence or absence of a 162-bp alternative exon (Espinosa, L. and Navarro, E. 

(1998) Cytogenet. Cell Genet. 81:278-282). Both human isoforms are coexpressed in a 
number of cell lines and tissues, with the highest expression found in heart, brain, 
placenta, skeletal muscle, and pancreas, and at lower levels in lung, liver, and kidney 
(Inglis, J. et al. (1993) Mammalian Genome 4: 401-403). Due to the physical location of 

15 this gene, 1 Iql2-ql3, EMK1 is a candidate gene for carcinogenic events (Courseaux, A. et 
al. (1995) Mammalian Genome 6: 311-312), and has been associated with colon and 
prostate cancer (Moore, T. M., et al. (2000) J Biol Chem 275:4311-22; Navarro, R, et al. 

(1999) Biochim Biophys Acta 1450: 254-64). 

Microtubule affinity regulating kinase 3 (MARK3) was originally identified as a 
20 marker (KP78) induced by treatment with DNA damaging agents. The loss of MARK3 

was associated with carcinogenesis in the pancreas (Parsa, I. (1988) Cell Growth Differ. 9: 
197-208). MARK3 may be involved in cell cycle regulation, and alterations in the 
MARK3 gene may lead to carcinogenesis. MARK 3 is ubiquitously expressed throughout 
human tissues, with an additional 3.0 Kb transcript present in the heart (Peng, C. et al. 
25 (1998 ) Cell Growth Differ. 9: 197-208). 

MAP/microtubule affinity-regulating kinase like 1 (MARKL1) has two isoforms 
(Nagase, T. et al. (2001) DNA Res. 8: 85-95), is activated by the beta-catenin/Tcf complex 
in hepatic cell lines, and may be involved in hepatic carcinogenesis (Kato, T. et al. (2001). 
Neoplasia 3:4-9). 

30 The ability to manipulate the genomes of model organisms such as Drosophila 

provides a powerful means to analyze biochemical processes that, due to significant 
evolutionary conservation, has direct relevance to more complex vertebrate organisms. 
Due to a high level of gene and pathway conservation, the strong similarity of cellular 
processes, and the functional conservation of genes between these model organisms and 
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mammals, identification of the involvement of novel genes in particular pathways and 
their functions in such model organisms can directly contribute to the understanding of the 
correlative pathways and methods of modulating them in mammals (see, for example, 
Mechler BM et al., 1985 EMBO J 4:1551-1557; Gateff E. 1982 Adv. Cancer Res. 37: 33- 
5 74; Watson KL., et al., 1994 J Cell Sci. 18: 19-33; Miklos GL, and Rubin GM. 1996 Cell 
86:521-529; Wassarman DA, et al., 1995 Curr Opin Gen Dev 5: 44-50; and Booth DR. 
1999 Cancer Metastasis Rev. 18: 261-284). For example, a genetic screen can be carried 
out in an invertebrate model organism having underexpression (e.g. knockout) or 
overexpression of a gene (referred to as a "genetic entry point") that yields a visible 
10 phenotype. Additional genes are mutated in a random or targeted manner. When a gene 
mutation changes the original phenotype caused by the mutation in the genetic entry point, 
the gene is identified as a "modifier" involved in the same or overlapping pathway as the 
genetic entry point. When the genetic entry point is an ortholog of a human gene 
implicated in a disease pathway, such as p53, modifier genes can be identified that may be 
15 attractive candidate targets for novel therapeutics. 

All references cited herein, including sequence information in referenced Genbank 
identifier numbers and website references, are incorporated herein in their entireties. 

SUMMARY OF THE INVENTION 

20 We have discovered genes that modify the p53 pathway in Drosophila, and identified 

their human orthologs, hereinafter referred to as MARK. The invention provides methods 
for utilizing these p53 modifier genes and polypeptides to identify candidate therapeutic 
agents that can be used in the treatment of disorders associated with defective p53 
function. Preferred MARK-modulating agents specifically bind to MARK polypeptides 

25 and restore p53 function. Other preferred MARK-modulating agents are nucleic acid 

modulators such as antisense oligomers andRNAi that repress MARK gene expression or 
product activity by, for example, binding to and inhibiting the respective nucleic acid (i.e. 
DNAormRNA). 

MARK-specific modulating agents may be evaluated by any convenient in vitro or in 
30 vivo assay for molecular interaction with a MARK polypeptide or nucleic acid. In one 

embodiment, candidate p53 modulating agents are tested with an assay system comprising 
a MARK polypeptide or nucleic acid. Candidate agents that produce a change in the 
activity of the assay system relative to controls are identified as candidate p53 modulating 
agents. The assay system may be cell-based or cell-free. MARK-modulating agents 
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include MARK related proteins (e.g. dominant negative mutants, and biotherapeutics); 
MARK-specific antibodies; MARK-specific antisense oligomers and other nucleic acid 
modulators; and chemical agents that specifically bind MARK or compete with MARK 
binding target. In one specific embodiment, a small molecule modulator is identified 
5 using a kinase assay. In specific embodiments, the screening assay system is selected 
from a binding assay, an apoptosis assay, a cell proliferation assay, an angiogenesis assay, 
and a hypoxic induction assay. 

In another embodiment, candidate p53 pathway modulating agents are further tested 
using a second assay system that detects changes in the p53 pathway,- such as angiogenic, 
10 apoptotic, or cell proliferation changes produced by the originally identified candidate 
agent or an agent derived from the original agent. The second assay system may use 
cultured cells or non-human animals. In specific embodiments, the secondary assay 
system uses non-human animals, including animals predetermined to have a disease or 
disorder implicating thep53 pathway, such as an angiogenic, apoptotic, or cell 
15 proliferation disorder (e.g. cancer). 

The invention further provides methods for modulating the p53 pathway in a 
mammalian cell by contacting the mammalian cell with an agent that specifically binds a 
MARK polypeptide or nucleic acid. The agent may be a small molecule modulator, a 
nucleic acid modulator, or an antibody and may be administered to a mammalian animal 
20 predetermined to have a pathology associated the p53 pathway. 

DETAILED DESCRIPTION OF THE INVENTION 

Genetic screens were designed to identify modifiers of the p53 pathway in Drosophila 
in which p53 was overexpressed in the wing (Ollmann M, et al., Cell 2000 101: 91-101). 
25 The KP78a gene was identified as a modifier of the p53 pathway. Accordingly, vertebrate 
orthologs of these modifiers, and preferably the human orthologs, microtubule affinity 
regulator kinase (MARK) genes (i.e., nucleic acids and polypeptides) are attractive drug 
targets for the treatment of pathologies associated with a defective p53 signaling pathway, 
such as cancer. 

30 hi vitro and in vivo methods of assessing MARK function are provided herein. 

Modulation of the MARK or their respective binding partners is useful for understanding 
the association of the p53 pathway and its members in normal and disease conditions and 
for developing diagnostics and therapeutic modalities for p53 related pathologies. 
MARK-modulating agents mat act by inhibiting or enhancing MARK expression, directly 
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or indirectly, for example, by affecting a MARK, function such as enzymatic (e.g., 
catalytic) or binding activity, can be identified using methods provided herein. MARK 
modulating agents are useful in diagnosis, therapy and pharmaceutical development. 

5 Nucleic acids and poly peptides of the invention 

Sequences related to MARK nucleic acids and polypeptides that can be used in the 
invention are disclosed in Geribank (referenced by Genbank identifier (Gl) number) as 
GI#s 9845486 (SEQ ID NO: 1), 9845488 (SEQ ID NO:2), 18578044 (SEQ ID NO:3), 
14250621 (SEQ ID NO:6), 15042610 (SEQ ID NO:7), 8923921 (SEQ ID NO:8), 
10 17445805 (SEQ ID NO:9), 7959214 (SEQ ID NO: 1 1), 14042208 (SEQ ID NO: 12), . 
3089348 (SEQ ID NO:13), 4505102 (SEQ ID NO:14), 5714635 (SEQ ID NO:15), 
18448970 (SEQ ID NO:18), 13366083 (SEQ ID NO:19), 14017936 (SEQ ID NO:22), and 
16555377 (SEQ ID NO:23) for nucleic acid, and GI#s 9845487 (SEQ ID NO:24), 
8923922 (SEQ ID NO:25), 3089349 (SEQ ID NO:26), 4505103 (SEQ ID NO:27), 
15 13366084 (SEQ ID NO:28) and 13899225 (SEQ ID NO:29) for polypeptides. 

Additionally, nucleic acid sequences of SEQ ID NOs:4, 5, 16, 17, 20, 21, and novel 
nucleic acid sequence of SEQ ID NO: 10 can also be used in the invention. 

MARKs are kinase proteins with kinase and UBA/TS-N domains. The term "MARK 
polypeptide" refers to a full-length MARK protein or a functionally active fragment or 
20 derivative thereof. A "functionally active" MARK fragment or derivative exhibits one or 
more functional activities associated with a full-length, wild-type MARK protein, such as 
antigenic or immunogenic activity, enzymatic activity, ability to bind natural cellular 
substrates, etc. The functional activity of MARK proteins, derivatives and fragments can 
be assayed by various methods known to one skilled in the art (Current Protocols in 
25 Protein Science (1998) Coligan et ah, eds., John Wiley & Sons, Inc., Somerset, New 

Jersey) and as further discussed below. For purposes herein, functionally active fragments 
also include those fragments mat comprise one or more structural domains of a MARK 
such as a kinase domain or a binding domain. Protein domains can be identified using the 
PFAM program (Bateman A., et al., Nucleic Acids Res, 1999, 27:260-2; 
30 http:// pfaTTi.wustl.edu) . For example, the proten kinase domains of MARKs from GI#s 
9845487 (SEQ ID NO:24), 8923922 (SEQ ID NO:25), 4505103 (SEQ ID NO:27), and 
13899225 (SEQ ID NO:29) is located at approximately amino acid residues 20 to 271, 60 
to 31 1, 56 to 307, and 59 to 3 10, respectively (PFAM 00069). Further, the ubiquitin 
associated (UBA/TS-N) domains of MARKs from GI#s 9845487 (SEQ ID NO:24), 
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8923922 (SEQ ID NO:25), 4505103 (SEQ ID NO:27and 13899225 (SEQ ID NO:29) is 
located at approximately amino acid residues 291 to 330, 331 to 370, 327 to 366, and 330 
to 369, respectively (PFAM 00627). Methods for obtaining MARK polypeptides are also 
further described below. In some embodiments, preferred fragments are functionally 
5 active, domain-containing fragments comprising at least 25 contiguous amino acids, 
preferably at least 50, more preferably 75, and most preferably at least 100 contiguous 
amino acids of any one of SEQ ED NOs:24, 25,26,27, 28, or 29 (a MARK). In further 
preferred embodiments, the fragment comprises the entire kinase (functionally active) 
domain. 

10 The term "MARK nucleic acid" refers to a DNA or RNA molecule that encodes a 

MARK polypeptide. Preferably, the MARK polypeptide or nucleic acid or fragment 
thereof is from a human, but can also be an ortholog, or derivative thereof with at least 
70% sequence identity, preferably at least 80%, more preferably 85%, still more 
preferably 90%, and most preferably at least 95% sequence identity with MARK. 
15 Normally, orthologs in different species retain the same function, due to presence of one 
or more protein motifs and/or 3-dimensional structures. Orthologs are generally identified 
by sequence homology analysis, such as BLAST analysis, usually using protein bait 
sequences. Sequences are assigned as a potential ortholog if the best hit sequence from 
the forward BLAST result retrieves the original query sequence in the reverse BLAST 
20 (Huynen MA and Bork P, Proc Natl Acad Sci (1998) 95:5849-5856; Huynen MA et al, 

Genome Research (2000) 10:1204-1210). Programs for multiple sequence alignment, such 
as CLUSTAL (Thompson JD et al, 1994, Nucleic Acids Res 22:4673-4680) may be used 
to highlight conserved regions and/or residues of orthologous proteins and to generate 
phylogenetic trees. In a phylogenetic tree representing multiple homologous sequences 
25 from diverse species (e.g., retrieved through BLAST analysis), orthologous sequences 
from two species generally appear closest on the tree with respect to all other sequences 
from these two species. Structural threading or other analysis of protein folding (e.g., 
using software by ProCeryon, Biosciences, Salzburg, Austria) may also identify potential 
orthologs. In evolution, when a gene duplication event follows speciation, a single gene in 
30 one species, such as Drosophila, may correspond to multiple genes (paralogs) in another, 
such as human. As used herein, the term "orthologs" encompasses paralogs. As used 
herein, "percent (%) sequence identity" with respect to a subject sequence, or a specified 
portion of a subject sequence, is defined as the percentage of nucleotides or amino acids in 
the candidate derivative sequence identical with the nucleotides or amino acids in the 
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subject sequence (or specified portion thereof), after aligning the sequences and 
introducing gaps, if necessary to achieve the maximum percent sequence identity, as 
generated by the program WU~BLAST-2.0al9 (Altschul et aZ., J. Mol. Biol. (1997) 
215:403-410; http;//blast. wusti.eduMastZElEADME.html) with all the search parameters 
5 set to default values. The HSP S and HSP S2 parameters are dynamic values and are 
established by the program itself depending upon the composition of the particular 
sequence and composition of the particular database against which the sequence of interest 
is being searched. A % identity value is determined by the number of matching identical 
nucleotides or amino acids divided by the sequence length for which the percent identity is 
10 being reported. "Percent (%) amino acid sequence similarity" is determined by doing the 
same calculation as for determining % amino acid sequence identity, but including 
conservative amino acid substitutions in addition to identical amino acids in the 
computation. 

A conservative amino acid substitution is one in which an amino acid is substituted for 
15 another amino acid having similar properties such that the folding or activity of the protein 
is not significantly affected. Aromatic amino acids that can be substituted for each other 
are phenylalanine, tryptophan, and tyrosine; interchangeable hydrophobic amino acids are 
leucine, isoleucine, methionine, and valine; interchangeable polar amino acids are 
glutamine and asparagine; interchangeable basic amino acids are arginine, lysine and 
20 histidine; interchangeable acidic amino acids are aspartic acid and glutamic acid; and 
interchangeable small amino acids are alanine, serine, threonine, cysteine and glycine. 

Alternatively, an alignment for nucleic acid sequences is provided by the local 
homology algorithm of Smith and Waterman (Smith and Waterman, 1981, Advances in 
Applied Mathematics 2:482-489; database: European Bioinformatics Institute 
25 http://www.ebi.ac.uk/MPsrch/; Smith and Waterman, 1981, J. of Molec.Biol., 147: 195- 
197; Nicholas et al., 1998, "A Tutorial on Searching Sequence Databases and Sequence 
Scoring Methods" (www.psc.edu) and references cited therein.; W.R. Pearson, 1991, 
Genomics 11:635-650). This algorithm can be applied to amino acid sequences by using 
the scoring matrix developed by Dayhoff (Dayhoff : Adas of Protein Sequences and 
30 Structure, M. O. Dayhoff ed., 5 suppl. 3:353-358, National Biomedical Research 

Foundation, Washington, D.C., USA), and normalized by Gribskov (Gribskov 1986 
Nucl. Acids Res. 14(6):6745-6763). The Smith-Waterman algorithm may be employed 
where default parameters are used for scoring (for example, gap open penalty of 12, gap 
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extension penalty of two). From the data generated, trie "Match" value reflects "sequence 
identity." 

Derivative nucleic acid molecules of the subject nucleic acid molecules include 
sequences that hybridize to the nucleic acid sequence of any of SEQ ID NOs:l through 23. 

5 The stringency of hybridization can be controlled by temperature, ionic strength, pH, and 
the presence of denaturing agents such as formamide during hybridization and washing. 
Conditions routinely used are set out in readily available procedure texts (e.g., Current 
Protocol in Molecular Biology, Vol. 1, Chap. 2.10, John "Wiley & Sons, Publishers (1994); 
Sambrook et al, Molecular Cloning, Cold Spring Harbor (1989)). In some embodiments, 
10 a nucleic acid molecule of the invention is capable of hybridizing to a nucleic acid 

molecule containing the nucleotide sequence of any one of SEQ ID NOs:l through 23 
under stringent hybridization conditions that comprise: prehybridization of filters 
containing nucleic acid for 8 hours to overnight at 65° C in a solution comprising 6X 
single strength citrate (SSC) (IX SSC is 0.15 M NaCl, 0.015 M Na citrate; pH 7.0), 5X 

15 Denhardt's solution, 0.05% sodium pyrophosphate and 100 jigfwl herring sperm DNA; 
hybridization for 18-20 hours at 65° C in a solution containing 6X SSC, IX Denhardt's 
solution, 100 /Ag/ml yeast tRNA and 0.05% sodium pyrophosphate; and washing of filters 
at 65° C for lh in a solution containing 0.2X SSC and 0.1% SDS (sodium dodecyl 
sulfate). 

20 In other embodiments, moderately stringent hybridization conditions are used that 

comprise: pretreatment of filters containing nucleic acid for 6 h at 40° C in a solution 
containing 35% formamide, 5X SSC, 50 mM Tris-HCl (pH7.5), 5mM EDTA, 0.1% PVP, 
0.1% Ficoll, 1% BSA, and 500 /ig/ml denatured salmon sperm DNA; hybridization for 
18-20h at 40° C in a solution containing 35% formamide, 5X SSC, 50 mM Tris-HCl 
25 (pH7.5), 5mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 /ig/ml salmon sperm 

DNA, and 10% (wt/vol) dextran sulfate; followed by washing twice for 1 hour at 55° C in 
a solution containing 2X SSC and 0.1% SDS. 

Alternatively, low stringency conditions can be used that comprise: incubation for 8 
hours to overnight at 37° C in a solution comprising 20% formamide, 5 x SSC, 50 mM 
30 sodium phosphate (pH 7.6), 5X Denhardt' s solution, 10% dextran sulfate, and 20 ju.g/ml 
denatured sheared salmon sperm DNA; hybridization in the same buffer for 18 to 20 
hours; and washing of filters in 1 x SSC at about 37° C for 1 hour. 
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gggjaflgn v^OnHUm. Krares^ 9 nd Mis-expression of MARK Nucleic Acids and 
Polypeptides 

MARK nucleic acids and polypeptides, useful for identifying and testing agents that 
modulate MARK function and for other applications related to the involvement of MARK 
5 in the p53 pathway. MARK nucleic acids and derivatives and orlhologs thereof may be 
obtained using any available metibod. For instance, techniques for isolating cDNA or 
genomic DNA sequences of interest by screening DNA libraries or by using polymerase 
chain reaction (PGR) are well known in the art. In general, the particular use for the 
protein will dictate the particulars of expression, production, and purification methods. 
10 For instance, production of proteins for use in screening for modulating agents may 
require methods that preserve specific biological activities of these proteins, whereas 
production of proteins for antibody generation may require structural integrity of particular 
epitopes. Expression of proteins to be purified for screening or antibody production may 
require the addition of specific tags {e.g., generation of fusion proteins). Overexpression 
15 of a MARK protein for assays used to assess MARK function, such as involvement in cell 
cycle regulation or hypoxic response, may require expression in eukaryotic cell lines 
capable of these cellular activities. Techniques for the expression, production, and 
purification of proteins are well known in the art; any suitable means therefore may be 
used (e.g., Higgins SJ andHames BD (eds.) Protein Expression: A Practical Approach, 
20 Oxford University Press Inc., New York 1999; Stanbury PF et al., Principles of 

Fermentation Technology, 2 nd edition, Elsevier Science, New York, 1995; Doonan S (ed.) 
Protein Purification Protocols, Humana Press, New Jersey, 1996; Coligan JE et al, Current 
Protocols in Protein Science (eds.), 1999, John Wiley & Sons, New York). In particular 
embodiments, recombinant MARK is expressed in a cell line known to have defective p53 
25 function (e.g. SAOS-2 osteoblasts, H1299 lung cancer cells, C33A and HT3 cervical 
cancer cells, HT-29 and DLD-1 colon cancer cells, among others, available from 
American Type Culture Collection (ATCC), Manassas, VA). The recombinant cells are 
used in cell-based screening assay systems of the invention, as described further below. 
The nucleotide sequence encoding a MARK polypeptide can be inserted into any 
30 appropriate expression vector. The necessary transcriptional and translational signals, 
including promoter/enhancer element, can derive from the native MARK gene and/or its 
flanking regions or can be heterologous. A variety of host-vector expression systems may 
be utilized, such as mammalian cell systems infected with virus (e.g. vaccinia virus, 
adenovirus, etc.); insect cell systems infected with virus (e.g. baculovirus); 
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microorganisms such as yeast containing yeast vectors, or bacteria transformed with 
bacteriophage, plasmid, or cosmid DNA. A host cell strain that modulates the expression 
of, modifies, and/or specifically processes the gene product may be used. 

To detect expression of the MARK gene product, the expression vector can comprise a 
5 promoter operably linked to a MARK gene nucleic acid, one or more origins of 

replication, and, one or more selectable markers (e.g. thymidine kinase activity, resistance 
to antibiotics, etc.). Alternatively, recombinant expression vectors can be identified by 
assaying for the expression of the MARK gene product based on the physical or functional 
properties of the MARK protein in in vitro assay systems (e.g. immunoassays). 
10 The MARK protein, fragment, or derivative may be optionally expressed as a fusion, 

or chimeric protein product (i.e. it is joined via a peptide bond to a heterologous protein 
sequence of a different protein), for example to facilitate purification or detection. A 
chimeric product can be made by ligating the appropriate nucleic acid sequences encoding 
the desired amino acid sequences to each other using standard methods and expressing the 
15 chimeric product. A chimeric product may also be made by protein synthetic techniques, 
e.g- by use of a peptide synthesizer (Hunkapiller et ah, Nature (1984) 310:105-111). 

Once a recombinant cell that expresses the MARK gene sequence is identified, the 
gene product can be isolated and purified using standard methods (e.g. ion exchange, 
affinity, and gel exclusion chromatography; centrifugation; differential solubility; 
20 electrophoresis, cite purification reference). Alternatively, native MARK proteins can be 
purified from natural sources, by standard methods (e.g. immunoaffinity purification). 
Once a protein is obtained, it may be quantified and its activity measured by appropriate 
methods, such as immunoassay, bioassay, or other measurements of physical properties, 
such as crystallography. 

25 The methods of this invention may also use cells that have been engineered for altered 

expression (mis-expression) of MARK or other genes associated with the p53 pathway. 
As used herein, mis-expression encompasses ectopic expression, over-expression, under- 
expression, and non-expression (e.g. by gene knock-out or blocking expression that would 
otherwise normally occur). 

30 

Genetically modified animals 

Animal models that have been genetically modified to alter MARK expression may be 
used in in vivo assays to test for activity of a candidate p53 modulating agent, or to further 
assess the role of MARK in a p53 pathway process such as apoptosis or cell proliferation. 



10 



WO 02/098890 PCT/US02/17461 

Preferably, the altered MARK expression results in a detectable phenotype, such as 
decreased or increased levels of cell proliferation, angiogenesis, or apoptosis compared to 
control animals having normal MARK expression. The genetically modified animal may 
additionally have altered p53 expression (e.g. p53 knockout). Preferred genetically 
5 modified animals are mammals such as primates, rodents (preferably mice), cows, horses, 
goats, sheep, pigs, dogs and cats. Preferred non-mammalian species include zebrafish, C. 
elegans, and Drosophila. Preferred genetically modified animals are transgenic animals 
having a heterologous nucleic acid sequence present as an extrachromosomal element in a 
portion of its cells, i.e. mosaic animals (see, for example, techniques described by 
10 Jakobovits, 1994, Curr. Biol. 4:761-763.) or stably integrated into its germ line DNA (i.e., 
in the genomic sequence of most or all of its cells). Heterologous nucleic acid is 
introduced into the germ hue of such transgenic animals by genetic manipulation of, for 
example, embryos or embryonic stem cells of the host animal. 

Methods of making transgenic animals are well-known in the art (for transgenic mice 
15 see Brinster et al., Proc. Nat. Acad. Sci. USA 82: 4438-4442 (1985), U.S. Pat. Nos. 

4,736,866 and 4,870,009, both by Leder et al., U.S. Pat. No. 4,873,191 by Wagner et al., 
and Hogan, B., Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., (1986); for particle bombardment see U.S. Pat. No., 4,945,050, 
by Sandford et al.; for transgenic Drosophila see Rubin and Spradling, Science (1982) 
20 218:348-53 and U.S. Pat. No. 4,670,388; for transgenic insects see Berghammer A. J. et 
al, A Universal Marker for Transgenic Insects (1999) Nature 402:370-371; for transgenic 
Zebrafish see Lin S., Transgenic Zebrafish, Methods Mol Biol. (2000);136:375-3830); for 
microinjection procedures for fish, amphibian eggs and birds see Houdebine and 
Chourrout, Experientia (1991) 47:897-905; for transgenic rats see Hammer et al., Cell 
25 (1990) 63 : 1099-1 1 12; and for culturing of embryonic stem (ES) cells and the subsequent 
production of transgenic animals by the introduction of DNA into ES cells using methods 
such as electroporation, calcium phosphate/DNA precipitation and direct injection see, 
e.g., Teratocarcinomas and Embryonic Stem Cells, A Practical Approach, E. J. Robertson, 
ed., IRL Press (1987)). Clones of the nonhuman transgenic animals can be produced 
30 according to available methods (see Wilmut, I. et al. (1997) Nature 385:810-813; and PCT 
International Publication Nos. WO 97/07668 and WO 97/07669). 

In one embodiment, the transgenic animal is a "knock-out" animal having a 
heterozygous or homozygous alteration in the sequence of an endogenous MARK gene 
that results in a decrease of MARK function, preferably such that MARK expression is 
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undetectable or insignificant. Knock-out animals are typically generated by homologous 
recombination with a vector comprising a transgene having at least a portion of the gene to 
be knocked out. Typically a deletion, addition or substitution has been introduced into the 
transgene to functionally disrupt it. The transgene can be a human gene (e.g., from a 

5 human genomic clone) but more preferably is an ortholog of the human gene derived from 
the transgenic host species. For example, a mouse MARK gene is used to construct a 
homologous recombination vector suitable for altering an endogenous MARK gene in the 
mouse genome. Detailed methodologies for homologous recombination in mice are 
available (see Capecchi, Science (1989) 244:1288-1292; Joyner et aL y Nature (1989) 

10 338:153-156). Procedures for the production of non-rodent transgenic mammals and other 
animals are also available (Houdebine and Chourrout, supra\ Pursel et aL, Science (1989) 
244:1281-1288; Simms et al, Bio/Technology (1988) 6:179-183). la a preferred 
embodiment, knock-out animals, such as mice harboring a knockout of a specific gene, 
may be used to produce antibodies against the human counterpart of the gene that has been 

15 knocked out (Claesson MH et al., (1994) Scan J Immunol 40:257-264; DeclerekPJet 
aL, (1995) J Biol Chem. 270:8397-400). 

In another embodiment, the transgenic animal is a "knock-in" animal having an 
alteration in its genome that results in altered expression (e.g., increased (including 
ectopic) or decreased expression) of the MARK gene, e.g., by introduction of additional 

20 copies of MARK, or by operatively inserting a regulatory sequence that provides for 

altered expression of an endogenous copy of the MARK gene. Such regulatory sequences 
include inducible, tissue-specific, and constitutive promoters and enhancer elements. The 
knock-in can be homozygous or heterozygous. 

Transgenic nonhuman animals can also be produced that contain selected systems 

25 allowing for regulated expression of the transgene. One example of such a system that 
may be produced is the cre/loxP recombinase system of bacteriophage PI (Lakso et ah, 
PNAS (1992) 89:6232-6236; U.S. Pat. No. 4,959,317). If a cre/loxP recombinase system 
is used to regulate expression of the transgene, animals containing transgenes encoding 
both the Cre recombinase and a selected protein are required. Such animals can be 

30 provided through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the other 
containing a transgene encoding a recombinase. Another example of a recombinase 
system is the ELP recombinase system of Saccharomyces cerevisiae (O f Gorman et al. 
(1991) Science 251:1351-1355; U.S. Pat. No. 5,654,182). In a preferred embodiment, 

12 
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both Cre-LoxP andFlp-Frt are used in the same system to regulate expression of the 
transgene, and for sequential deletion of vector sequences in the same cell (Sun X et al 
(2000) Nat Genet 25:83-6). 

The genetically modified animals can be used in genetic studies to further elucidate the 
5 p53 pathway, as animal models of disease and disorders implicating defective p53 

function, and for in vivo testing of candidate therapeutic agents, such as those identified in 
screens described below. The candidate therapeutic agents are administered to a 
genetically modified animal having altered MARK function and phenotypic changes are 
compared with appropriate control animals such as genetically modified animals that 
10 receive placebo treatment, and/or animals with unaltered MARK expression that receive 
candidate therapeutic agent. 

In addition to the above-described genetically modified animals having altered MARK 
function, animal models having defective p53 function (and otherwise normal MARK 
function), can be used in the methods of the present invention. For example, a p53 
15 knockout mouse can be used to assess, in vivo, the activity of a candidate p53 modulating 
agent identified in one of the in vitro assays described below. p53 knockout mice are 
described in the literature (Jacks et al., Nature 2001;410:1111-1116, 1043-1044; 
Donehower et al, supra). Preferably, the candidate p53 modulating agent when 
administered to a model system with cells defective in p53 function, produces a detectable 
20 phenotypic change in the model system indicating that the p53 function is restored, i.e., 
the cells exhibit normal cell cycle progression. 

Modulating Agents 

The invention provides methods to identify agents that interact with and/or modulate 
25 the function of MARK and/or the p53 pathway. Such agents are useful in a variety of 
diagnostic and therapeutic applications associated with the p53 pathway, as well as in 
further analysis of the MARK protein and its contribution to the p53 pathway. 
Accordingly, the invention also provides methods for modulating the p53 pathway 
comprising the step of specifically modulating MARK activity by administering a MARK- 
30 interacting or -modulating agent. 

In a preferred embodiment, MARK-modulating agents inhibit or enhance MARK 
activity or otherwise affect normal MARK function, including transcription, protein 
expression, protein localization, and cellular or extra-cellular activity. In a further 
preferred embodiment, the candidate p53 pathway- modulating agent specifically 
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modulates the function of the MARK. The phrases "specific modulating agent", 
"specifically modulates", etc., axe used herein to refer to modulating agents that directly 
bind to the MARK polypeptide or nucleic acid, and preferably inhibit, enhance, or 
otherwise alter, the function of the MARK. The term also encompasses modulating agents 
5 that alter the interaction of the MARK with a binding partner or substrate (e.g. by binding 
to a binding partner of a MARK, or to a protein/binding partner complex, and inhibiting 
function). 

Preferred MARK-modulating agents include small molecule compounds; MARK- 
interacting proteins, including antibodies and other biotherapeutics; and nucleic acid 

10 modulators such as antisense and RNA inhibitors. The modulating agents may be 
formulated in pharmaceutical compositions, for example, as compositions that may 
comprise other active ingredients, as in combination therapy, and/or suitable carriers or 
excipients. Techniques for formulation and administration of the compounds may be 
found in "Remington's Pharmaceutical Sciences" Mack Publishing Co., Easton, PA, 19 th 

15 edition. 



Small molecule modulators 

Small molecules, are often preferred to modulate function of proteins with enzymatic 
function, and/or containing protein interaction domains. Chemical agents, referred to in 
20 the art as "small molecule" compounds are typically organic, non-peptide molecules, 
having a molecular weight less than 10,000, preferably less than 5,000, more preferably 
less than 1,000, and most preferably less than 500. This class of modulators includes 
chemically synthesized molecules, for instance, compounds from combinatorial chemical 
libraries. Synthetic compounds may be rationally designed or identified based on known 
25 or inferred properties of the MARK protein or may be identified by screening compound 
libraries. Alternative appropriate modulators of this class are natural products, particularly 
secondary metabolites from organisms such as plants or fungi, which can also be 
identified by screening compound libraries for MARK-modulating activity. Methods for 
generating and obtaining compounds are well known in the art (Schreiber SL, Science 
30 (2000) 151: 1964-1969; Radmann J and Gunther J, Science (2000) 151:1947-1948). 

Small molecule modulators identified from screening assays, as described below, can 
be used as lead compounds from which candidate clinical compounds may be designed, 
optimized, and synthesized. Such clinical compounds may have utility in treating 
pathologies associated with the p53 pathway. The activity of candidate small molecule 
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modulating agents may be improved several-fold through iterative secondary functional 
validation, as further described below, structure determination, and candidate modulator 
modification and testing. Additionally, candidate clinical compounds are generated with 
specific regard to clinical and pharmacological properties. For example, the reagents may 
5 be derivatized and re-screened using in vitro and in vivo assays to optimize activity and 
minimize toxicity for pharmaceutical development. 

Protein Modulators 

Specific MARK-interacting proteins are useful in a variety of diagnostic and 
10 therapeutic applications related to the p53 pathway and related disorders, as well as in 
validation assays for other MARK-modulating agents. In a preferred embodiment, 
MARK-interacting proteins affect normal MARK function, including transcription, 
protein expression, protein localization, and cellular or extra-cellular activity. In another 
embodiment, MARK-interacting proteins are useful in detecting and providing 
15 information about the function of MARK proteins, as is relevant to p53 related disorders, 
such as cancer (e.g., for diagnostic means). 

An MARK-interacting protein may be endogenous, i.e. one that naturally interacts 
genetically or biochemically with a MARK, such as a member of the MARK pathway that 
modulates MARK expression, localization, and/or activity. MARK-modulators include 
20 dominant negative forms of MARK-interacting proteins and of MARK proteins 

themselves. Yeast two-hybrid and variant screens offer preferred methods for identifying 
endogenous MARK-interacting proteins (Finley, R. L. et al. (1996) in DNA Cloning- 
Expression Systems: A Practical Approach, eds. Glover D. & Hames B. D (Oxford 
University Press, Oxford, England), pp. 169-203; Fashema SF et al., Gene (2000) 250:1- 
25 14; Drees BL Curr Opin Chem Biol (1999) 3:64-70; Vidal M and Legrain P Nucleic Acids 
Res (1999) 27:919-29; and U.S. Pat. No. 5,928,868). Mass spectrometry is an alternative 
preferred method for the elucidation of protein complexes (reviewed in, e.g., Pandley A 
and Mann M, Nature (2000) 405:837-846; Yates JR 3 rd , Trends Genet (2000) 16:5-8). 
An MARK-interacting protein may be an exogenous protein, such as a MARK- 
30 specific antibody or a T-cell antigen receptor (see, e.g., Harlow and Lane (1988) 

Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory; Harlow and Lane 
(1999) Using antibodies: a laboratory manual. Cold Spring Harbor, NY: Cold Spring 
Harbor Laboratory Press). MARK antibodies are further discussed below. 

15 
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In preferred embodiments, a MARK-interacting protein specifically binds a MARK 
protein. In alternative preferred embodiments, a MARK-modulating agent binds a MARK 
substrate, binding partner, or cofactor. 

5 Antibodies 

In another embodiment, the protein modulator is a MARK specific antibody agonist or 
antagonist. The antibodies have therapeutic and diagnostic utilities, and can be used in 
screening assays to identify MARK modulators. The antibodies can also be used in 
dissecting the portions of the MARK pathway responsible for various cellular responses 
10 and in the general processing and maturation of the MARK 

Antibodies that specifically bind MARK polypeptides can be generated using known 
methods. Preferably the antibody is specific to a mammalian ortholog of MARK 
polypeptide, and more preferably, to human MARK. Antibodies may be polyclonal, 
monoclonal (mAbs), humanized or chimeric antibodies, single chain antibodies, Fab 
15 fragments, F(ab').sub.2 fragments, fragments produced by a FAb expression library, anti- 
idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. 
Epitopes of MARK which are particularly antigenic can be selected, for example, by 
routine screening of MARK polypeptides for antigenicity or by applying a theoretical 
method for selecting antigenic regions of a protein (Hopp and Wood (1981), Proc. Nati. 
20 Acad. Sci. U.S.A. 78:3824-28; Hopp and Wood, (1983) Mol. Immunol. 20:483-89; 

Sutcliffe et al., (1983) Science 219:660-66) to the amino acid sequence shown in any of 
SEQ ID NOs:24, 25, 26, 27, 28, or 29. Monoclonal antibodies with affinities of 10 8 M 1 
preferably 10 9 M 1 to 10 10 M 1 , or stronger can be made by standard procedures as 
described (Harlow and Lane, supra; Goding (1986) Monoclonal Antibodies: Principles 
25 and Practice (2d ed) Academic Press, New York; and U.S. Pat. Nos. 4,381,292; 4,451,570; 
and 4,618,577). Antibodies may be generated against crude cell extracts of MARK or 
substantially purified fragments thereof. If MARK fragments are used, they preferably 
comprise at least 10, and more preferably, at least 20 contiguous amino acids of a MARK 
protein. In a particular embodiment, MARK-specific antigens. and/or immunogens are 
30 coupled to carrier proteins that stimulate the immune response. For example, the subject 
polypeptides are covalendy coupled to the keyhole limpet hemocyanin (KLH) carrier, and 
the conjugate is emulsified in Freund's complete adjuvant, which enhances the immune 
response. An appropriate immune system such as a laboratory rabbit or mouse is 
immunized according to conventional protocols. 
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The presence of MARK-specific antibodies is assayed by an appropriate assay such as 
a solid phase enzyme-linked immunosorbant assay (ELBA) using immobilized 
corresponding MARK polypeptides. Other assays, such as radioimmunoassays or 
fluorescent assays might also be used. 
5 Chimeric antibodies'specific to MARK polypeptides can be made that contain 

different portions from different animal species. For instance, a human immunoglobulin 
constant region may be linked to a variable region of a murine mAb, such that the 
antibody derives its biological activity from the human antibody, and its binding 
specificity from me murine fragment. Chimeric antibodies are producedby splicing. 
10 together genes that encode the appropriate regions from each species (Morrison et al., 

Proc. Natl. Acad. Sci. (1984) 81:6851-6855; Neuberger et al., Nature (1984) 312:604-608; 
Takeda et al., Nature (1985) 31:452-454). Humanized antibodies, which are a form of 
chimeric antibodies, can be generated by grafting complementary-determining regions 
(CDRs) (Carlos, T. M., J. M. Harlan. 1994. Blood 84:2068-2101) of mouse antibodies 
15 into a background of human framework regions and constant regions by recombinant 
DMA technology (Riechmann 1M, et al., 1988 Nature 323: 323-327). Humanized 
antibodies contain -10% murine sequences and -90% human sequences, and thus further 
reduce or eliminate immunogenicity, while retaining the antibody specificities (Co MS, 
and Queen C. 1991 Nature 351: 501-501; Morrison SL. 1992 Ann. Rev. hnmun. 
20 10:239-265). Humanized antibodies and methods of their production are well-known in 
the art (U.S. Pat. Nos. 5,530,101, 5,585,089, 5,693,762, and 6,180,370). 

MARK-specific single chain antibodies which are recombinant, single chain 
polypeptides formed by Unking the heavy and light chain fragments of the Fv regions via 
an amino acid bridge, can be produced by methods known in the art (U.S. Pat. No. 
25 4,946,778; Bird, Science (1988) 242:423-426; Huston et al., Proc. Natl. Acad. Sci. USA 
(1988) 85:5879-5883; and Ward et al., Nature (1989) 334:544-546). 

Other suitable techniques for antibody production involve in vitro exposure of 
lymphocytes to the antigenic polypeptides or alternatively to selection of libraries of 
antibodies in phage or similar vectors (Huse et al., Science (1989) 246:1275-1281). As 
30 used herein, T-cell antigen receptors are included within the scope of antibody modulators 
(Harlow and Lane, 1988, supra). 

The polypeptides and antibodies of the present invention may be used with or without 
modification. Frequently, antibodies will be labeled by joining, either covalently or non- 
covalently, a substance that provides for a detectable signal, or that is toxic to cells that 
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express the targeted protein (Menard S, et al., Int J. Biol Markers (19S9) 4:131-134). A 
wide variety of labels and conjugation techniques are known and are reported extensively 
in both the scientific and patent literature. Suitable labels include radionuclides, enzymes, 
substrates, cofactors, inhibitors, fluorescent moieties, fluorescent emitting lanthanide 

5 metals, chemiluminescent moieties, bioluminescent moieties, magnetic particles, and the 
like (U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; 
and 4,366,241). Also, recombinant immunoglobulins may be produced (U.S. Pat. No. 
4,816,567). Antibodies to cytoplasmic polypeptides may be delivered and reach their 
targets by conjugation with membrane-penetrating toxin proteins (U.S . Pat. No. 

10 6,086,900). 

When used therapeutically in a patient, the antibodies of the subject invention are 
typically administered parenterally, when possible at the target site, or intravenously. The 
therapeutically effective dose and dosage regimen is determined by clinical studies. 
Typically, the amount of antibody administered is in the range of about 0.1 mg/kg -to 

15 about 10 mg/kg of patient weight. For parenteral administration, the antibodies are 
formulated in a unit dosage injectable form (e.g., solution, suspension, emulsion) in 
association with a pharmaceutically acceptable vehicle. Such vehicles are inherently 
nontoxic and non-therapeutic. Examples are water, saline, Ringer's solution, dextrose 
solution, and 5% human serum albumin. Nonaqueous vehicles such as fixed oils, ethyl 

20 oleate, or liposome carriers may also be used. The vehicle may contain minor amounts of 
additives, such as buffers and preservatives, which enhance isotonicity and chemical 
stability or otherwise enhance therapeutic potential. The antibodies' concentrations in 
such vehicles are typically in the range of about 1 mg/ml to aboutlO mg/ml. 
Immunotherapeutic methods are further described in the literature (US Pat. No. 5,859,206; 

25 WO0073469). 

Nucleic Acid Modulators 

Other preferred MARK-modulating agents comprise nucleic acid molecules, such as 
antisense oligomers or double stranded RNA (dsRNA), which generally inhibit MARK 
30 activity. Preferred nucleic acid modulators interfere with the function of the MARK 

nucleic acid such as DNA replication, transcription, translocation of the MARK RNA to 
the site of protein translation, translation of protein from the MARK RNA, splicing of the 
MARK RNA to yield one or more mRNA species, or catalytic activity which may be 
engaged in or facilitated by the MARK RNA. 
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In one embodiment, the antisense oligomer is an oligonucleotide that is sufficiently 
complementary to a MARK mRN A to bind to and prevent translation, preferably by 
binding to the 5' untranslated region. MARK-specific antisense oligonucleotides, 
preferably range from at least 6 to about 200 nucleotides. In some embodiments the 

5 oligonucleotide is preferably at least 10, 15, or 20 nucleotides in length. In other 
embodiments, the oligonucleotide is preferably less than 50, 40, or 30 nucleotides in 
length. The oligonucleotide can be DNA or RNA or a chimeric mixture or derivatives or 
modified versions thereof, single-stranded or double-stranded. The oligonucleotide can be 
modified at the base moiety, sugar moiety, or phosphate backbone. The oligonucleotide 

10 may include other appending groups such as peptides, agents that facilitate transport 
across the cell membrane, hybridization-triggered cleavage agents, and intercalating 
agents. 

In another embodiment, the antisense oligomer is a phosphothioate morpholino 
oligomer (PMO). PMOs are assembled from four different morpholino subunits, each of 
15 which contain one of four genetic bases (A, C, G, or T) linked to a six-membered 

morpholine ring. Polymers of these subunits are joined by non-ionic phosphodiamidate 
intersubuhit linkages. Details of how to make and use PMOs and other antisense 
oligomers are well known in the art (e.g. see W099/18193; Probst JC, Antisense 
Oligodeoxynucleotide and Ribozyme Design, Methods. (2000) 22(3):271-281; Summerton 
20 J, and Weller D. 1997 Antisense Nucleic Acid Drug Dev. :7:187-95; US Pat. No. 
5,235,033; and US Pat No. 5,378,841). 

Alternative preferred MARK nucleic acid modulators are double-stranded RNA 
species mediating RNA interference (RNAi). RNAi is the process of sequence-specific, 
post-transcriptional gene silencing in animals and plants, initiated by double-stranded 
25 RNA (dsRNA) that is homologous in sequence to the silenced gene. Methods relating to 
the use of RNAi to silence genes in C. elegans, Drosophila, plants, and humans are known 
in the art (Fire A, et al., 1998 Nature 391:806-811; Fire, A. Trends Genet. 15, 358-363 
(1999); Sharp, P. A. RNA interference 2001. Genes Dev. 15, 485-490 (2001); Hammond, 
S. M-, et al., Nature Rev. Genet. 2, 110-1119 (2001); Tuschl, T. Chem. Biochem. 2, 239- 
30 245 (2001); Hamilton, A. et al., Science 286, 950-952 (1999); Hammond, S. M., et al., 
Nature 404, 293-296 (2000); Zamore, P. D., et al., Cell 101, 25-33 (2000); Bernstein, E., 
et al., Nature 409, 363-366 (2001); Elbashir, S. M., et al., Genes Dev. 15, 188-200 
(2001); WO0129058; WQ9932619; Elbashir SM, et al., 2001 Nature 411:494-498). 
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Nucleic acid modulators are commonly used as research reagents* diagnostics, and 
therapeutics. For example, antisense oligonucleotides, which are able to inhibit gene 
expression with exquisite specificity, are often used to elucidate the function of particular 
genes (see, for example, U.S. Pat. No. 6,165,790). Nucleic acid modulators are also used, 
5 for example, to distinguish between functions of various members of a biological pathway. 
For example, antisense oligomers have been employed as therapeutic moieties in the 
treatment of disease states in animals and man and have been demonstrated in numerous 
clinical trials to be safe and effective (Milligan JF, et oZ, Current Concepts in Antisense 
Drug Design, J Med Chem. (1993) 36:1923-1937; TonMnson JL et al^ Antisense 

10 Oligodeoxynucleotides as Clinical Therapeutic Agents, Cancer Invest. (1996) 14:54-65). 
Accordingly, in one aspect of the invention, a MARK-specific nucleie acid modulator is 
used in an assay to further elucidate the role of the MARK in the p53 pathway, and/or its 
relationship to other members of the pathway. In another aspect of the invention, a 
MARK-specific antisense oligomer is used as a therapeutic agent for treatment of p53- 

15 related disease states. 

Assay Systems 

The invention provides assay systems and screening methods for identifying specific 
modulators of MARK activity. As used herein, an "assay system" encompasses all the 

20 components required for performing and analyzing results of an assay that detects and/or 
measures a particular event. In general, primary assays are used to identify or confirm a 
modulator's specific biochemical or molecular effect with respect to the MARK nucleic 
acid or protein. In general, secondary assays further assess the activity of a MARK 
modulating agent identified by a primary assay and may confirm that the modulating agent 

25 affects MARK in a manner relevant to the p53 pathway. In some cases, MARK 
modulators will be directly tested in a secondary assay. 

In a preferred embodiment, the screening method comprises contacting a suitable 
assay system comprising a MARK polypeptide with a candidate agent under conditions 
whereby, but for the presence of the agent, the system provides a reference activity (e.g. 

30 kinase activity), which is based on the particular molecular event the screening method 
detects. A statistically significant difference between the agent-biased activity and the 
reference activity indicates that the candidate agent modulates MARK activity, and hence 
the p53 pathway. 

20 
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Primary Assays 

The type of modulator tested generally determines the type of primary assay. 

Primary assays for small molecule modulators 
5 For small molecule modulators, screening assays are used to identify candidate 

modulators. Screening assays may be cell-based or may use a cell-free system that 
recreates or retains the relevant biochemical reaction of the target protein (reviewed in 
Sittampalam GS et al, Curr Opin Chem Biol (1997) 1:384-91 and accompanying 
references). As used herein the term "cell-based" refers to assays using live cells, dead 
10 cells, or a particular cellular fraction, such as a membrane, endoplasmic reticulum, or 
mitochondrial fraction. The term "cell free" encompasses assays using substantially 
purified protein (either endogenous or recombinantly produced), partially purified or crude 
cellular extracts. Screening assays may detect a variety of molecular events, includrng 
protein-DNA interactions, protein-protein interactions (e.g., receptor-ligand binding), 
15 transcriptional activity (e.g., using a reporter gene), enzymatic activity (e.g., via a property 
of the substrate), activity of second messengers, immunogenicty and changes in cellular 
morphology or other cellular characteristics. Appropriate screening assays may use a wide 
range of detection methods including fluorescent, radioactive, colorimetric, 
spectrophotometric, and amperometric methods, to provide a read-out for the particular 

20 molecular event detected. 

Cell-based screening assays usually require systems for recombinant expression of 
MARK and any auxiliary proteins demanded by the particular assay. Appropriate 
methods for generating recombinant proteins produce sufficient quantities of proteins that 
retain their relevant biological activities and are of sufficient purity to optimize activity 

25 and assure assay reproducibility . Yeast two-hybrid and variant screens, and mass 

spectrometry provide preferred methods for determining protein-protein interactions and 
elucidation of protein complexes. In certain applications, when MARK-interacting 
proteins are used in screens to identify small molecule modulators, the binding specificity 
of the interacting protein to the MARK protein may be assayed by various known methods 

30 such as substrate processing (e.g. ability of the candidate MARK-specific binding agents 
to function as negative effectors in MARK-expressing cells), binding equilibrium 
' constants (usually at least about 10 7 M 1 , preferably at least about 10 8 M"\ more preferably 
at least about 10 9 M 1 ), and immunogenicity (e.g. ability to elicit MARK specific antibody 
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in a heterologous host such as a mouse, rat, goat or rabbit). For enzymes and receptors, 
binding may be assayed by, respectively, substrate and ligand processing. 

The screening assay may measure a candidate agent's ability to specifically bind to or 
modulate activity of a MARKpolypepti.de, a fusion protein thereof, or to cells or 

5 membranes bearing the polypeptide or fusion protein. The MARK polypeptide can be Ml 
length or a fragment thereof that retains functional MARK activity. The MARK 
polypeptide may be fused to another polypeptide, such as a peptide tag for detection or 
anchoring, or to another tag. The MARK polypeptide is preferably human MARK, or is 
an ortholog or derivative thereof as described above. In a preferred embodiment, the 

10 screening assay detects candidate agent-based modulation of MARK interaction with a 
binding target, such as an endogenous or exogenous protein or other substrate that has 
MARK -specific binding activity, and can be used to assess normal MARK gene function. 

Suitable assay formats that may be adapted to screen for MARK modulators are 
known in the art. Preferred screening assays are high throughput or ultra high throughput 

15 and thus provide automated, cost-effective means of screening compound libraries for lead 
compounds (Fernandes PB, Curr Opin ChemBiol (1998) 2:597-603; Sundberg SA, Curr 
Opin Biotechnol 2000, 11:47-53). In one preferred embodiment, screening assays uses 
fluorescence technologies, including fluorescence polarization, time-resolved 
fluorescence, and fluorescence resonance energy transfer. These systems offer means to 

20 monitor protein-protein or DNA-protein interactions in which the intensity of the signal 
emitted from dye-labeled molecules depends upon their interactions with partner 
molecules (e.g., Selvin PR, Nat Struct Biol (2000) 7:730-4; Fernandes PB, supra; 
Hertzberg RP and Pope AJ, Curr Opin Chem Biol (2000) 4:445-451). 

A variety of suitable assay systems may be used to identify candidate MARK and p53 

25 pathway modulators (e.g. U.S. Pat.No. 6,165,992 (kinase assays); U.S. Pat. Nos. 

5,550,019 and 6,133,437 (apoptosis assays); U.S. Pat. No. 6,020,135 (p53 modulation), 
among others). Specific preferred assays are described in more detail below. 



Kinase assays. In some preferred embodiments the screening assay detects the ability 
30 of the test agent to modulate the kinase activity of a MARK polypeptide. In further 
embodiments, a cell-free kinase assay system is used to identify a candidate p53 
modulating agent, and a secondary, cell-based assay, such as an apoptosis or hypoxic 
induction assay (described below), may be used to further characterize the candidate p53 
modulating agent. Many different assays for kinases have been reported in the literature 
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and are well known to those skilled in the art (e.g. U.S. Pat. No. 6,165,992; Zhu et aL, 
Nature Genetics (2000) 26:283-289; and WO0073469). Radioassays, which monitor the 
transfer of a gamma phosphate are frequently used. For instance, a scintillation assay for 
p56 (tek) kinase activity monitors the transfer of the gamma phosphate from gamma - 33 P 

5 ATP to a biotinylated peptide substrate; the substrate is captured on a streptavidin coated 
bead that transmits the signal (Beveridge M et aL, J Biomol Screen (2000) 5:205-212). 
This assay uses the scintillation proximity assay (SPA), in which only radio-ligand bound 
to receptors tethered to the surface of an SPA bead are detected by the scintillant 
immobilized within it, allowing binding to be measured without separation of bound from 

10 free ligand. 

Other assays for protein kinase activity may use antibodies that specifically recognize 
phosphorylated substrates. For instance, the kinase receptor activation (KLRA) assay 
measures receptor tyrosine kinase activity by ligand stimulating the intact receptor in 
cultured cells, then capturing solubilized receptor with specific antibodies and quantifying 
15 phosphorylation via phosphotyrosine ELISA (Sadick MD, Dev Biol Stand (1999) 97: 121- 
133). 

Another example of antibody based assays for protein kinase activity is TRF (time- 
resolved fluorometry). This method utilizes europium chelate-labeled anti- 
phosphotyrosine antibodies to detect phosphate transfer to a polymeric substrate coated 
20 onto microliter plate wells. The amount of phosphorylation is then detected using time- 
resolved, dissociation-enhanced fluorescence (Braunwalder AF, et aL, Anal Biochem 1996 
Jul l;238(2):159-64). 



Apoptosis assays. Assays for apoptosis may be performed by terminal 
25 deoxynucleotidyl transferase-mediated digoxigenin-ll-dUTP nick end labeling (TUNEL) 
assay. The TUNEL assay is used to measure nuclear DNA fragmentation characteristic of 
apoptosis ( Lazebnik et aL, 1994, Nature 371, 346), by following the incorporation of 
fluorescein-dUTP (Yonehara et aL, 1989, J. Exp. Med. 169, 1747). Apoptosis may further 
be assayed by acridine orange staining of tissue culture cells (Lucas, R., et aL, 1998, Blood 
30 15 :4730-41). An apoptosis assay system may comprise a cell that expresses a MARK, and 
that optionally has defective p53 function (e.g. p53 is over-expressed or under-expressed 
relative to wild-type cells). A test agent can be added to the apoptosis assay system and 
changes in induction of apoptosis relative to controls where no test agent is added, identify 
candidate p53 modulating agents. In some embodiments of the invention, an apoptosis 
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assay may be used as a secondary assay to test a candidate p53 modulating agents that is 
initially identified using a cell-free assay system. An apoptosis assay may also be used to 
test whether MARK function plays a direct role in apoptosis. For example, an apoptosis 
assay may be performed on cells that over- or under-express MARK relative to wild type 
5 cells. Differences in apoptotic response compared to wild type cells suggests that the 
MARK plays a direct role in the apoptotic response. Apoptosis assays are described 
further in US Pat. No. 6,133,437. 

Cell proliferation and cell cycle assays. Cell proliferation may be assayed via 
10 bromodeoxyuridine (BRDU) incorporation. This assay identifies a cell population 
undergoing DNA synthesis by incorporation of BRDU into newly-synthesized DNA. 
Newly-synthesized DNA may then be detected using an anti-BRDU antibody (Hoshino et 
aU 1986, Int. J. Cancer 38, 369; Campana et ah, 1988, J. Immunol. Meth. 107, 79), or by 
other means. 

15 Cell Proliferation may also be examined using [ 3 H]-thymidine incorporation (Chen, J., 

1996, Oncogene 13:1395-403; Jeoung, J., 1995, J. Biol. Chem. 270:18367-73). This 
assay allows for quantitative characterization of S-phase DNA syntheses. In this assay, 
cells synthesizing DNA will incorporate [ 3 H] -thymidine into newly synthesized DNA. 
Incorporation can then be measured by standard techniques such as by counting of 

20 radioisotope in a scintillation counter (e.g., Beckman LS 3800 Liquid Scintillation 
Counter). 

Cell proliferation may also be assayed by colony formation in soft agar (Sambrook et 
al., Molecular Cloning, Cold Spring Harbor (1989)). For example, cells transformed with 
MARK are seeded in soft agar plates, and colonies are measured and counted after two 

25 weeks incubation. 

Involvement of a gene in the cell cycle may be assayed by flow cytometry (Gray JW et 
al. (1986) Int J Radiat Biol Relat Stud Phys Chem Med 49:237-55). Cells transfected with 
a MARK may be stained with propidium iodide and evaluated in a flow cytometer 
(available from Becton Dickinson). 

30 Accordingly, a cell proliferation or cell cycle assay system may comprise a cell that 

expresses a MARK, and that optionally has defective p53 function (e.g. p53 is over- 
expressed or under-expressed relative to wild-type cells). A test agent can be added to the 
assay system and changes in cell proliferation or cell cycle relative to controls where no 
test agent is added, identify candidate p53 modulating agents. In some embodiments of 
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the invention, the cell proliferation or cell cycle assay may be used as a secondary assay to 
test a candidate p53 modulating agents that is initially identified using another assay 
system such as a cell-free kinase assay system. A cell proliferation assay may also be used 
to test whether MARK function plays a direct role in cell proliferation or cell cycle. For 
5 example, a cell proliferation or cell cycle assay may be performed on cells that over- or 
under-express MARK relative to wild type cells. Differences in proliferation or cell cycle 
compared to wild type cells suggests that the MARK plays a direct role in cell 
proliferation or cell cycle. 

10 Angiogenesis. Angiogenesis may be assayed using various human endothelial cell 

systems, such as umbilical vein, coronary artery, or dermal cells. Suitable assays include 
Alamar Blue based assays (available from Biosource International) to measure 
proliferation; migration assays using fluorescent molecules, such as the use of Becton 
Dickinson Falcon HTS FluoroBlock cell culture inserts to measure migration of cells 
15 through membranes in presence or absence of angiogenesis enhancer or suppressors; and 
tubule formation assays based on the formation of tubular structures by endothelial cells 
on Matrigel® (Becton Dickinson). Accordingly, an angiogenesis assay system may 
comprise a cell that expresses a MARK and that optionally has defective P 53 function 
(e.g. p53 is over-expressed or under-expressed relative to wild-type cells). A test agent 
20 can be added to the angiogenesis assay system and changes in angiogenesis relative to 

controls where no test agent is added, identify candidate p53 modulating agents. In some 
embodiments of the invention, the angiogenesis assay may be used as a secondary assay to 
test a candidate p53 modulating agents that is initially identified using another assay 
system. An angiogenesis assay may also be used to test whether MARK function plays a 
25 direct role in cell proliferation. For example, an angiogenesis assay may be performed on 
cells that over- or under-express MARK relative to wild type cells. Differences in 
angiogenesis compared to wild type cells suggests that the MARK plays a direct role in 
angiogenesis. 

30 Hypoxic induction. The alpha subunit of the transcription factor, hypoxia inducible 

factor-1 (HIF-1), is upregulated in tumor cells following exposure to hypoxia in vitro. 
Under hypoxic conditions, HDDF-1 stimulates the expression of genes known to be 
important in tumour cell survival, such as those encoding glyolytic enzymes and VEGF. 
Induction of such genes by hypoxic conditions may be assayed by growing cells 
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transfected with MARK in hypoxic conditions (such as with 0.1% 02, 5% C02, and 
balance N2, generated in a Napco 7001 incubator (Precision Scientific)) and nonnoxic 
conditions, followed by assessment of gene activity or expression by Taqman®. For 
example, a hypoxic induction assay system may comprise a cell that expresses a MARK, 

5 and that optionally has a mutated p53 (e.g. p53 is over-expressed or under-expressed 
relative to wild-type cells). A test agent can be added to the hypoxic induction assay 
system and changes in hypoxic response relative to controls where no test agent is added, 
identify candidate p53 modulating agents. In some embodiments of the invention, the 
hypoxic induction assay may be used as a secondary assay to test a candidate p53 
10 modulating agents that is initially identified using another assay system. A hypoxic 

induction assay may also be used to test whether MARK function plays a direct role in the 
hypoxic response. For example, a hypoxic, induction assay may be performed on cells that 
over- or under-express MARK relative to wild type cells. Differences in hypoxic response 
compared to wild type cells suggests that the MARK plays a direct role in hypoxic 

15 induction. 

Cell adhesion. Cell adhesion assays measure adhesion of cells to purified adhesion 
proteins, or adhesion of cells to each other, in presence or absence of candidate 
modulating agents. Cell-protein adhesion assays measure the ability of agents to modulate 
20 the adhesion of cells to purified proteins. For example, recombinant proteins are 

produced, diluted to 2.5g/mL in PBS, and used to coat the wells of a microliter plate. The 
wells used for negative control are not coated. Coated wells are then washed, blocked 
with 1% BSA, and washed again. Compounds are diluted to 2x final test concentration 
and added to the blocked, coated wells. Cells are then added to the wells, and the unbound 
25 cells are washed off. Retained cells are labeled directly on the plate by adding a 

membrane-permeable fluorescent dye, such as calcein-AM, and the signal is quantified in 
a fluorescent microplate reader. 

Cell-cell adhesion assays measure the ability of agents to modulate binding of cell 
adhesion proteins with their native ligands. These assays use cells that naturally or 
30 recombinantly express the adhesion protein of choice. In an exemplary assay, cells 
expressing the cell adhesion protein are plated in wells of a multiwell plate. Cells 
expressing the ligand aTe labeled with a membrane-permeable fluorescent dye, such as 
BCECF , and allowed to adhere to the monolayers in the presence of candidate agents. 
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Unbound cells are washed off, and bound cells are detected using a fluorescence plate 
reader. 

High-throughput cell adhesion assays have also been described. In one such assay, 
small molecule ligands and peptides are bound to the surface of microscope slides using a 
5 microarray spotter, intact cells are then contacted with the slides, and unbound cells are 
washed off. In this assay, not only the binding specificity of the peptides and modulators 
against cell lines are determined, but also the functional cell signaling of attached cells 
using immunofluorescence techniques in situ on the microchip is measured (Falsey JR et 
al., Bioconjug Chem. 2001 May-Jun;12(3):346-53). 

10 

Primary assays for antibody modulators 

For antibody modulators, appropriate primary assays test is a binding assay that tests 
the antibody's affinity to and specificity for the MARK protein. Methods for testing 
antibody affinity and specificity are well known in the art (Harlow and Lane, 1988, 1999, 
15 supra). The enzyme-linked immunosorbant assay (ELISA) is a preferred method for 

detecting MARK-specific antibodies; others include FACS assays, radioimmunoassays, 
and fluorescent assays. 

Primary assays for nucleic acid modulators 
20 For nucleic acid modulators, primary assays may test the ability of the nucleic acid 

modulator to inhibit or enhance MARK gene expression, preferably mKNA expression. In 
general, expression analysis comprises comparing MARK expression in like populations 
of cells (e.g. , two pools of cells that endogenously or recombinantly express MARK) in 
the presence and absence of the nucleic acid modulator. Methods for analyzing mRNA 
25 and protein expression are well known in the art. For instance, Northern blotting, slot 
blotting, ribonuclease protection, quantitative RT-PCR (e.g., using the TaqMan®, PE 
Applied Biosystems), or microarray analysis may be used to confirm that MARK mRNA 
expression is reduced in cells treated with the nucleic acid modulator (e.g., Current 
Protocols in Molecular Biology (1994) Ausubel FM et al, eds., John Wiley & Sons, Inc., 
30 chapter 4; Freeman WM et al, Biotechniques (1999) 26:112-125; Kallioniemi OP, Ann 
Med 2001, 33:142-147; Blohm DH and Guiseppi-Elie, A Curr Opin Biotechnol 2001, 
12:41-47). Protein expression may also be monitored. Proteins are most commonly ' 
detected with specific antibodies or antisera directed against either the MARK protein or 
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specific peptides. A variety of means including Western blotting, EL1SA, or in situ 
detection, are available (Harlow E and Lane D, 1988 and 1999, supra). 



Secondary Assays 

5 Secondary assays may be used to further assess the activity of MAJRK-modulating 

agent identified by any of the above methods to confirm that the modulating agent affects 
MARK in a manner relevant to the p53 pathway. As used herein, MARIC-modulating 
agents encompass candidate clinical compounds or other agents derived from previously 
identified modulating agent. Secondary assays can also be used to test the activity of a 

10 modulating agent on a particular genetic or biochemical pathway or to test the specificity 
of the modulating agent's interaction with MARK. 

Secondary assays generally compare like populations of cells or animals (e.g., two 
pools of cells or animals that endogenousiy or recombinantly express MARK) in the 
presence and absence of the candidate modulator. In general, such assays test whether 

15 treatment of cells or animals with a candidate MARK-modulating agent results in changes 
in the p53 pathway in comparison to untreated (or mock- or placebo-treated) cells or 
animals. Certain assays use "sensitized genetic backgrounds", which, as used herein, 
describe cells or animals engineered for altered expression of genes in the p53 or 
interacting pathways. 

20 

Cell-based assays 

Cell based assays may use a variety of mammalian cell lines known to have defective 
p53 function (e.g. SAOS-2 osteoblasts, H1299 lung cancer cells, C33A and HT3 cervical 
cancer cells, HT-29 and DLD-1 colon cancer cells, among others, available from 
25 American Type Culture Collection (ATCC), Manassas, VA). Cell based assays may 
detect endogenous p53 pathway activity or may rely on recombinant expression of p53 
pathway components. Any of the aforementioned assays may be used in this cell-based 
format. Candidate modulators are typically added to the cell media but may also be 
injected into cells or delivered by any other efficacious means. 

30 

Animal Assays 

A variety of non-human animal models of normal or defective p53 pathway may be 
used to test candidate MARK modulators. Models for defective p53 pathway typically use 
genetically modified animals that have been engineered to mis-express (e.g., over-express 
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or lack expression in) genes involved in the p53 pathway. Assays generally require 
systemic delivery of the candidate modulators, such as by oral administration, injection, 
etc. 

In a preferred embodiment, p53 pathway activity is assessed by monitoring 

5 neovascularization and angiogenesis. Animal models with defective and normal p53 are 
used to test the candidate modulator's affect on MARK in Matrigel® assays. Matrigel® is 
an extract of basement membrane proteins, and is composed primarily of laminin, collagen 
IV, and heparin sulfate proteoglycan. It is provided as a sterile liquid at 4° C, but rapidly 
forms a solid gel at 37° C. Liquid Matrigel® is mixed with various angiogenic agents, 
10 such as bFGF and VEGF, or with human tumor cells which over-express the MARK. The 
mixture is then injected subcutaneously(SC) into female athymic nude mice (Taconic, 
Germantown, NY) to support an intense vascular response. Mice with Matrigel® pellets 
may be dosed via oral (PO), intraperitoneal (BP), or intravenous (IV) routes with the 
candidate modulator. Mice are euthanized 5-12 days post-injection, and the Matrigel® 

15 pellet is harvested for hemoglobin analysis (Sigma plasma hemoglobin kit). Hemoglobin 
content of the gel is found to correlate the degree of neovascularization in the gel. 

In another preferred embodiment, the effect of the candidate modulator on MARK is 
assessed via tumorigenicity assays. In one example, xenograft human tumors are 
implanted SC into female athymic mice, 6-7 week old, as single cell suspensions either 

20 from a pre-existing tumor or from in vitro culture. The tumors which express the MARK 
endogenously are injected in the flank, 1 x 10 5 to 1 x 10 7 cells per mouse in a volume of 
100 fiL using a 27 gauge needle. Mice are then ear tagged and tumors are measured twice 
weekly. Candidate modulator treatment is initiated on the day the mean tumor weight 
reaches 100 mg. Candidate modulator is delivered IV, SC, BP, or PO by bolus 

25 administration. Depending upon the pharmacokinetics of each unique candidate 

modulator, dosing can be performed multiple times per day. The tumor weight is assessed 
by measuring perpendicular diameters with a caliper and calculated by multiplying the 
measurements of diameters in two dimensions. At the end of the experiment, the excised 
tumors maybe utilized for biomarker identification or further analyses. For 

30 immunohistochemistry staining, xenograft tumors are fixed in 4% paraformaldehyde, 

0.1M phosphate, pH 7.2, for 6 hours at 4°C, immersed in 30% sucrose in PBS, and rapidly 
frozen in isopentane cooled with liquid nitrogen. 
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Diagnostic and therapeutic uses 

Specific MARK-modulating agents are useful in a variety of diagnostic and 
therapeutic applications where disease or disease prognosis is related to defects in the p53 
pathway, such as angiogenic, apoptotic, or cell proliferation disorders. Accordingly, the 
5 invention also provides methods for modulating the p53 pathway in a cell, preferably a 
cell pre-determined to have defective p53 function, comprising the step of administering 
an agent to the cell that specifically modulates MARK activity. Preferably, the 
modulating agent produces a detectable phenotypic change in the cell indicating that the 
p53 function is restored, i.e., for example, the cell undergoes normal proliferation or 
10 progression through the cell cycle. 

The discovery that MARK is implicated in p53 pathway provides for a variety of 
methods that can be employed for the diagnostic and prognostic evaluation of diseases and 
disorders involving defects in the p53 pathway and for the identification of subjects having 
a predisposition to such diseases and disorders. 
15 Various expression analysis methods can be used to diagnose whether MARK 

expression occurs in a particular sample, including Northern blotting, slot blotting, 
ribonuclease protection, quantitative RT-PCR, and microarray analysis, (e.g., Current 
Protocols in Molecular Biology (1994) Ausubel FM et al, eds., John Wiley & Sons, Inc., 
chapter 4; Freeman WM et al, Biotechniques (1999) 26:112-125; KalHoniemi OP, Ann 
20 Med 2001, 33:142-147; Blohm and Guiseppi-Elie, Curr Opin Biotechnol 2001, 12:41-47). 
Tissues having a disease or disorder implicating defective p53 signaling that express a 
MARK, are identified as amenable to treatment with a MARK modulating agent. In a 
preferred application, the p53 defective tissue overexpresses a MARK relative to normal 
tissue. For example, a Northern blot analysis of mRNA from tumor and normal cell lines, 
25 or from tumor and matching normal tissue samples from the same patient, using full or 
partial MARK cDNA sequences as probes, can determine whether particular tumors 
express or overexpress MARK. Alternatively, the TaqMan® is used for quantitative RT- 
PCR analysis of MARK expression in cell lines, normal tissues and tumor samples (PE 
Applied Biosystems). 

30 Various other diagnostic methods may be performed, for example, utilizing reagents 

such as the MARK oligonucleotides, and antibodies directed against a MARK, as 
described above for: (1) the detection of the presence of MARK gene mutations, or the 
detection of either over- or under-expression of MARK mRNA relative to the non- 
disorder state; (2) the detection of either an over- or an under-abundance of MARK gene 
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product relative to the non-disorder state; and (3) the detection of perturbations or 
abnormalities in the signal transduction pathway mediated by MARK. 

Thus, in a specific embodiment, the invention is drawn to a method for diagnosing a 
disease in a patient, the method comprising: a) obtaining a biological sample from the 
5 patient; b) contacting the sample with a probe for MARK expression; c) comparing 
results from step (b) with a control; and d) detennining whether step (c) indicates a 
likelihood of disease. Preferably, the disease is cancer, most preferably a cancer as shown 
in TABLE 1. The probe may be either DNA or protein, including an antibody. 

EXAMPLES 

The following experimental section and examples are offered by way of illustration 
and not by way of limitation. 

I. Drosophila p53 screen 

The Drosophila p53 gene was overexpressed specifically in the wing using the 
vestigial margin quadrant enhancer. Increasing quantities of Drosophila p53 (titrated 
using different strength transgenic inserts in 1 or 2 copies) caused deterioration of normal 
wing morphology from mild to strong, with phenotypes including disruption of pattern and 
polarity of wing hairs, shortening and thickening of wing veins, progressive crumpling of 
the wing and appearance of dark "death" inclusions in wing blade. In a screen designed to 
identify enhancers and suppressors of Drosophila p53, homozygous females carrying two 
copies of p53 were crossed to 5663 males carrying random insertions of a piggyBac 
transposon (Fraser M et aU Virology (1985) 145:356-361). Progeny containing insertions 
were compared to non-insertion-bearing sibling progeny for enhancement or suppression 
of the p53 phenotypes. Sequence information surrounding the piggyBac insertion site was 
used to identify the modifier genes. Modifiers of the wing phenotype were identified as 
members of the p53 pathway. kp78a was a suppressor of the wing phenotype. Human 
orthologs of the modifiers are referred to herein as MARK. 

BLAST analysis (Altschul et aL, supra) was employed to identify Targets from 
Drosophila modifiers. For example, representative sequences from MARK GI# 9845487 
(SEQ ID NO:24 ), GI# 8923922(SEQ ID NO:25), GI# 4505103 (SEQ ID NO:27), and 
GI#13899225 (SEQ ID NO:29) share 43%, 65%, 65% and 45% amino acid identity , 
respectively, with the Drosophila kp78a. 
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Various domains, signals, and functional subunits in proteins were analyzed using the 
PSORT (Nakai K., and Horton P., Trends Biochem Sci, 1999, 24:34-6; Kenta Nakai, 
Protein sorting signals and prediction of subcellular localization, Adv. Protein Chem. 54, 
277-344 (2000)), PFAM (Bateman A., et al., Nucleic Acids Res, 1999, 27:260-2; 

5 htt p://pfam.wustl.edu), SMART (Poncing CP, et al., SMART: identification and annotation 
of domains from signaling and extracellular protein sequences. Nucleic Acids Res. 1999 
Jan l;27(l):229-32), TM-HMM (ErikL.L. Sonnhammer, Gunnar von Heijne, and Anders 
Krogh: A hidden Markov model for predicting transmembrane helices in protein 
sequences. In Proc. of Sixth Int. Conf . on Intelligent Systems for Molecular Biology, p 

10 175-1 82 Ed J. Glasgow, T. Littlejohn, F, Major, R Lafhrop, D. Sankoff, and C. Sensen 
Menlo Park, CA: AAAI Press, 1998), and clust (Remm M, and Sonnhammer E. 
Classification of transmembrane protein families in the Caenorhabditis elegans genome 
and identification of human orthologs. Genome Res. 2000 Nov;10(ll):1679-89) programs. 
For example, the proten kinase domains of MARKs from GI#s 9845487 (SEQ ID NO:24), 

15 8923922 (SEQ ID NO:25), 4505103 (SEQ ID NO:27), and 13899225 (SEQ ID NO:29) is 
located at approximately amino acid residues 20 to 271, 60 to 311, 56 to 307, and 59 to 
310, respectively (PFAM 00069). Further, the ubiquitin associated (UBA/TS-N) domains 
of MARKs from GI#s 9845487 (SEQ ID NO:24), 8923922 (SEQ ID NO:25), 4505103 
(SEQ ID NO:27), and 13899225 (SEQ ID NO:29) is located at approximately amino acid 

20 residues 291 to 330, 331 to 370, 327 to 366, and 330 to 369, respectively (PFAM 00627). 
Still further, the kinase associated domains from MARKs of GI#s9845487 (SEQ ID 
NO:24), 8923922 (SEQ ID NO:25), and 4505103 (SEQ ID NO:27) are located at 
approximately amino acid residues 696 to 745, 746 to 795, and 664 to 713, respectively 
(PFAM 02149). 

25 

II. High-Throughput In Vitro Fluorescence Polari zation Assay 
Fluorescently-labeled MARK peptide/substrate are added to each well of a 96-well 
microliter plate, along with a test agent in a test buffer (10 mM HEPES, 10 mM NaCl, 6 
mM magnesium chloride, pH 7.6). Changes in fluorescence polarization, determined by 
30 using a Fluorolite FPM-2 Fluorescence Polarization Microtiter System (Dynatech 

Laboratories, Inc), relative to control values indicates the test compound is a candidate 
modifier of MARK activity. 
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m. High-Throughput In Vitro Binding Assay. 
33 P-labeled MARK peptide is added in an assay buffer (100 mM KC1, 20 raM HEPES 
pH 7.6, 1 mM MgCl 2 , 1% glycerol, 0.5% NP-40, 50 mM beta-mercaptoethanol, 1 mg/ml 
BSA, cocktail of protease inhibitors) along with a test agent to the wells of a Neutralite- 
5 avidin coated assay plate and incubated at 25°C for 1 hour. Biotinylated substrate is then 
added to each well and incubated for 1 hour. Reactions are stopped by washing with PBS, 
and counted in a scintillation counter. Test agents that cause a difference in activity 
relative to control without test agent are identified as candidate p53 modulating agents. 

10 IV. Tmmnnop recipitations and Immunoblotting 

For coprecipitation of transfected proteins, 3 x 10 6 appropriate recombinant cells 
containing the MARK proteins are plated on 10-cm dishes and transfected on the 
following day with expression constructs. The total amount of DNA is kept constant in 
each transfection by adding empty vector. After 24 h, cells are collected, washed once 

15 with phosphate-buffered saline and lysed for 20 min on ice in 1 ml of lysis buffer 
containing 50 mM Hepes, pH 7.9, 250 mM NaCl, 20 mM -glycerophosphate, 1 mM 
sodium orthovanadate, 5 mM p-nitrophenyl phosphate, 2 mM dithiothreitol, protease 
inhibitors (complete, Roche Molecular Biochemicals), and 1% Nonidet P-40. Cellular 
debris is removed by centrifugation twice at 15,000 x g for 15 min. The cell lysate is 

20 incubated with 25 pi of M2 beads (Sigma) for 2 h at 4 °C with gentle rocking. 

After extensive washing with lysis buffer, proteins bound to the beads are solubilized 
by boiling in SDS sample buffer, fractionated by SDS-polyacrylamide gel electrophoresis, 
transferred to poly vinylidene difluoride membrane and blotted with the indicated 
antibodies. The reactive bands are visualized with horseradish peroxidase coupled to the 

25 appropriate secondary antibodies and the enhanced chemiluminescence (ECL) Western 
blotting detection system (Amersham Pharmacia Biotech). 

V. Kinase assay 

A purified or partially purified MARK is diluted in a suitable reaction buffer, e.g., 50 
30 mM Hepes, pH 7.5, containing magnesium chloride or manganese chloride (1-20 mM) and 
a peptide or polypeptide substrate, such as myelin basic protein or casein (1-10 jxg/ml). 
The final concentration of the kinase is 1-20 nM. The enzyme reaction is conducted in 
microliter plates to facilitate optimization of reaction conditions by increasing assay 
throughput. A 96- well microliter plate is employed using a final volume 30-100 fi\. The 
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reaction is initiated by the addition of 33 P-gamma~ATP (0.5 /xCi/rol) and incubated for 0.5 
to 3 hours at room temperature. Negative controls are provided by the addition of EDTA, 
which chelates the divalent cation (Mg 2 * or Mn 2+ ) required for enzymatic activity. 
Following the incubation, the enzyme reaction is quenched using EDTA. Samples of the 
5 reaction are transferred to a 96-well glass fiber filter plate (MultiScreen, Millipore). The 
filters are subsequently washed with phosphate-buffered saline, dilute phosphoric acid 
(0.5%) or other suitable medium to remove excess radiolabeled ATP. Scintillation 
cocktail is added to the filter plate and the incorporated radioactivity is quantitated by 
scintillation counting (WaHac/Perkin Elmer). Activity is defined by the amount of 
10 radioactivity detected following subtraction of the negative control reaction value (EDTA 
quench). 

VI. Expression analysis 
All cell lines used in the following experiments are NCI (National Cancer Institute) 
15 lines, and are available from ATCC (American Type Culture Collection, Manassas, VA 
20110-2209). Normal and tumor tissues were obtained from Impath, UC Davis, Clontech, 
Stratagene, and Ambion. 

TaqMan analysis was used to assess expression levels of the disclosed genes in various 
samples. 

20 RNA was extracted from each tissue sample using Qiagen (Valencia, CA) RNeasy 

kits, following manufacturer's protocols, to a final concentration of 50ng/|xl. Single 
stranded cDNA was then synthesized by reverse transcribing the RNA samples using 
random hexamers and 500ng of total RNA per reaction, following protocol 4304965 of 
Applied Biosystems (Foster City, CA, http://www.apphedbiosvstems.coin/ ). 

25 Primers for expression analysis using TaqMan assay (Applied Biosystems, Foster City, 

CA) were prepared according to the TaqMan protocols, and the following criteria: a) 
primer pairs were designed to span introns to eliminate genomic contamination, andb) 
each primer pair produced only one product. 

Taqman reactions were earned out following manufacturer's protocols, in 25 pi total 

30 volume for 96-well plates and 10 jjI total volume for 384-well plates, using 300nM primer 
and 250 nM probe, and approximately 25ng of cDNA. The standard curve for result 
analysis was prepared using a universal pool of human cDNA samples, which is a mixture 
of cDNAs from a wide variety of tissues so that the chance that a target will be present in 
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appreciable amounts is good. The raw data were normalized using 18S rRNA (universally 
expressed in all tissues and cells). 

For each expression analysis, tumor tissue samples were compared with matched 
normal tissues from the same patient. A gene was considered overexpressed in a tumor 
5 when the level of expression of the gene was 2 fold or higher in the tumor compared with 
its matched normal sample. 

Results are shown in Table 1 . Data presented in bold indicate that greater than 50% of 
tested tumor samples of the tissue type indicated in row 1 exhibited over expression of the 
gene listed in column 1, relative to normal samples. Underlined data indicates that 
10 between 25% to 49% of tested tumor samples exhibited over expression. A modulator 
identified by an assay described herein can be further validated for therapeutic effect by 
administration to a tumor in which the gene is overexpressed. A decrease in tumor growth 
confirms therapeutic utility of the modulator. Prior to treating a patient with the 
modulator, the likelihood that the patient will respond to treatment can be diagnosed by 
15 obtaining a tumor sample from the patient, and assaying for expression of the gene 
targeted by the modulator. The expression data for the gene(s) can also be used as a 
diagnostic marker for disease progression. The assay can be performed by expression 
analysis as described above, by antibody directed to the gene target, or by any other 
available detection method. 

20 

Table 1 





breast 






colon 






lung 






ovarv 


















GW9845486 (SEQ ID NO:l) 


7 


11 




8 


30 




8 


13 




5 


7 


GI#9845488 (SEQ ID NO:2) 


1 


11 




4 


30 




0 


13 




1 


7 


GI#8923921(SEQ ID NO:8) 


2 


11 




7 


30 




6 


13 




0 


7 


GI#3089348 (SEQ ID NO:13) 


2 


11 




2 


30 




0 


13 




2 


7 


GI#13366083 (SEQ ID NO: 19) 


2 


11 




2 


30 




5 


13 




1 


7 
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1. A method of identifying a candidate p53 pathway modulating agent, said method 
comprising the steps of: 

5 (a) providing an assay system comprising a purified MARK polypeptide or nucleic 

acid or a functionally active fragment or derivative thereof; 

(b) contacting the assay system with a test agent under conditions whereby, but for 
the presence of the test agent, the system provides a reference activity; and 

(c) detecting a test agent-biased activity of the assay system, wherein a difference 
10 between the test agent-biased activity and the reference activity identifies the test agent as 

a candidate p53 pathway modulating agent. 

2. The method of Claim 1 wherein the assay system comprises cultured cells that 
express the MARK polypeptide. 

15 

3. The method of Claim 2 wherein the cultured cells additionally have defective p53 
function. 

4. The method of Claim 1 wherein the assay system includes a screening assay 
20 comprising a MARK polypeptide, and the candidate test agent is a small molecule 

modulator. 

5. The method of Claim 4 wherein the assay is a kinase assay. 

25 6. The method of Claim 1 wherein the assay system is selected from the group 

consisting of an apoptosis assay system, a cell proliferation assay system, an angiogenesis 
assay system, and a hypoxic induction assay system. 

7. The method of Claim 1 wherein the assay system includes a binding assay 
30 comprising a MARK polypeptide and the candidate test agent is an antibody. 

8. The method of Claim 1 wherein the assay system includes an expression assay 
comprising a MARK nucleic acid and the candidate test agent is a nucleic acid modulator. 
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9. The method of claim 8 wherein the nucleic acid modulator is an antisense 
oligomer. 

10. The method of Claim 8 wherein the nucleic acid modulator is a PMO. 

5 

11. The method of Claim 1 additionally comprising: 

(d) administering the candidate p53 pathway modulating agent identified in (c) to a 
model system comprising cells defective in p53 function and, detecting a phenotypic 
change in the model system that indicates that the p53 function is restored. 

10 

12. The method of Claim 1 1 wherein the model system is a mouse model with 
defective p53 function. 

13. A method for modulating a p53 pathway of a cell comprising contacting a cell 
15 defective in p53 function with a candidate modulator that specifically binds to a MARK 

polypeptide comprising an amino acid sequence selected from group consisting of SEQ ID 
NOs:24, 25, 26, 27, 28, and 29, whereby p53 function is restored. 

14. The method of claim 13 wherein the candidate modulator is administered to a 
20 vertebrate animal predetermined to have a disease or disorder resulting from a defect in 

p53 function. 

15. The method of Claim 13 wherein the candidate modulator is selected from the 
group consisting of an antibody and a small molecule. 

25 

16. The method of Claim 1 , comprising the additional steps of: 

(d) providing a secondary assay system comprising cultured cells or a non-human 
animal expressing MARK , 

(e) contacting the secondary assay system with the test agent of (b) or an agent 

30 derived therefrom under conditions whereby, but for the presence of the test agent or agent 
derived therefrom, the system provides a reference activity; and 

(f) detecting an agent-biased activity of the second assay system, 
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wherein a difference between the agent-biased activity and the reference activity of the 
second assay system confirms the test agent or agent derived therefrom as a candidate p53 
pathway modulating agent, 

and wherein the second assay detects an agent-biased change in the p53 pathway. 

17. The method of Claim 16 wherein the secondary assay system comprises cultured 
cells. 



18. The method of Claim 16 wherein the secondary assay system comprises a non- 
10 human animal. 

19. The method of Claim 18 wherein the non-human animal mis-expresses a p53 
pathway gene. 

15 20. A method of modulating p53 pathway in a mammalian cell comprising contacting 
the cell with an agent that specifically binds a MARK polypeptide or nucleic acid. 

21 . The method of Claim 20 wherein the agent is administered to a mammalian animal 
predetermined to have a pathology associated with the p53 pathway. 

20 

22. The method of Claim 20 wherein the agent is a small molecule modulator, a 
nucleic acid modulator, or an antibody. 

23. A method for diagnosing a disease in a patient comprising: 
25 (a) obtaining a biological sample from the patient; 

(b) contacting the sample with a probe for MARK expression; 

(c) comparing results from step (b) with a control; 

(d) determining whether step (c) indicates a likelihood of disease. 



30 24. The method of claim 23 wherein said disease is cancer. 

25. The method according to claim 24, wherein said cancer is a cancer as shown in 
Table 1 as having >25% expression level. 
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<210> 1 

<211> 2946 

<212> DNA 

<213> Homo sapiens 



<400> 1 



tcctggaatt 


gcacgcgctt 


cctgaccacc 


aggctctggc 


cctfcgagaag 


ccagcggggc 


60 


tttgtccctg 


ttgctctcct 


tgccaaaccc 


agtctctctg 


ctagtggtgg 


tttcggttgc 


120 


gacaccgtcc 


aggttcccag 


gcaggaaccg 


ctcggcctgg 


ctgcttagct 


acttttcact 


180 


gaggaggtgg 


tggaaggtgt 


cgcctgctct 


ggcugagtaa 


gggtggctgg 


ctgagccggc 


240 


agcccccgcc 


ctaggcctgg 


ctcttcccgg 


cctctgtact 


ttgccctcgc 


tgcctgacag 


300 


gfctctgctgt 


gggctctgct 


gaatggaagt 


cgctggtagt 


cctfcttccct 


ttctccagtc 


360 


ggcccacctt 


gggacacctt 


gactccaagc 


ccagcagtaa 


gtccaacatg 


attcggggcc 


420 


gcaactcagc 


cacctctgct 


gatgagcagc 


cccacattgg 


aaactaccgg 


ctcctcaaga 


480 


ccattggcaa 


gggtaatttt 


gccaaggtga 


agttggcccg 


acacatcctg 


actgggaaag 


540 


aggtagctgt 


gaagatcatt 


gacaagactc 


aactgaactc 


ctccagcctc 


cagaaactat 


600 


tccgcgaagt 


aagaataatg 


aaggttttga 


atcatcccaa 


catagttaaa 


ttafcttgaag 


660 


fc gat t gaga c 


tgagaaaacg 


ctctaccttg 


tcatggagta 


cgctagtggc 


ggagaggtat 


720 


ttgattacct 


agtggctcat 


ggcaggatga 


aagaaaaaga 


ggctcgagcc 


aaattccgcc 


780 


agatagtgtc 


tgctgtgcag 


tactgtcacc 


agaagtttat 


tgtccataga 


gacttaaagg 


840 


cagaaaacct 


gctcttggat 


gctgatatga 


acatcaagat 


tgcagacttt 


ggcttcagca 


900 


atgaattcac 


ctttgggaac 


aagctggaca 


ccttctgtgg 


cagtccccct 


tatgctgccc 


960 


cagaactctt 


ccagggcaaa 


aaatatgatg 


gacccgaggt 


ggatgtgtgg 


agcctaggag 


1020 


ttatcctcta 


tacactggtc 


agcggatccc 


tgccttttga 


tggacagaac 


ctcaaggagc 


1080 
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tgcgggaacg ggtactgagg gggaaatacc gtattccatt ctacatgtcc acggactgtg 1140 

aaaacctgct fcaagaaattt ctcatcctta atcccagcaa gagaggcact ttagagcaaa 1200 

tcatgaaaga tcgatggatg aatgtgggtc acgaagatga tgaactaaag Gcttacgtgg 1260 

agccactccc tgactacaag gacccccggc ggacagagct gatggtgtcc <=tt;gggttata 1320 

cacgggaaga gatccaggac tcgctggtgg gccagagata caacgaggtg atggccacct 1380 

atctgctcct gggctacaag agctccgagc tggaaggcga caccatcacc ctgaaacccc 1440 

ggccttcagc fcgatctaacc aatagcagcg cccaattccc atcccacaag gtacagcgaa 1500 

gcgtgtcggc caatcccaag cagcggcgct tcagcgacca ggctggtcct gccattccca 1560 

cctctaattc ttactctaag aagactcaga gtaacaacgc agaaaataag cggcctgagg 1620 

aggaccggga gtcagggcgg aaagccagca gcacagccaa ggtgcctgcc agccccctgc 1680 

ccggtctgga gaggaagaag accaccccaa ccccctccac gaacagcgtc ctctccacca 1740 

gcacaaatcg aagcaggaat tccccacttt tggagcgggc cagcctcggc caggcctcca 1800 

tccagaatgg caaagacagc ctaaccatgc cagggtcccg ggcctccacg gjcttctgctt 1860 

ctgccgcagt cfcctgcggcc cggccccgcc agcaccagaa atccatgtcg g/cctccgtgc 1920 

accccaacaa ggcctctggg ctgcccccca cggagagtaa ctgtgaggtg ccgcggccca 1980 

gcacagcccc ccagcgtgtc ectgttgcct ccccatccgc ccacaacatc agcagcagtg 2040 

gtggagcccc agaccgaact aacttccccc ggggtgtgtc cagccgaagc accttccatg 2100 

ctgggcagct ccgacaggtg cgggaccagc agaatttgcc ctacggtgtg accccagcct 2160 

ctccctctgg ccacagccag ggccggcggg gggccfcctgg gagcatcttc agcaagttca 2220 

cctccaagtt tgtacgcagg aacctgaatg aacctgaaag caaagaccga g-tggagacgc 2280 

tcagacctca cgtggtgggc agtggcggca acgacaaaga aaaggaagaa tttcgggagg 2340 

ccaagccccg ctccctccgc ttcacgtgga gtatgaagac cacgagctcc atggagccca 2400 

acgagatgat gcgggagatc cgcaaggtgc tggacgcgaa cagctgccag agcgagctgc 2460 

atgagaagta cafcgctgctg tgcatgcacg gcacgccggg ccacgaggac ttcgtgcagt 2520 

gggagatgga ggtgtgcaaa ctgccgcggc tctctctcaa cggggttcga tttaagcgga 2580 

tatcgggcac ctccatggcc ttcaaaaaca ttgcctccaa aatagccaac gagctgaagc 2640 

tttaacaggc tgccaggagc gggggcggcg ggggcgggcc agctggacgg grctgccggcc 2700 

gtgcgccgcc ccacctgggc gagactgcag cgatggattg gtgtgtctcc ctgctggcac 2760 

ttctcccctc cctggccctt ctcagttfctc tcccacattc acccctgccc agagattccc 2820 

ccttctcctc tcccctactg gaggcaaagg aaggggaggg tggatggggg ggcagggctc 2880 

cccctcggta cfcgcggttgc acagagtatt tcgcctaaac caagaaattt tttattacca 2940 
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aaaaga 2946 

<210> 2 

<211> 2784 

<212> DNA 

<213> Homo sapiens 

<400> 2 

tcctggaatt gcacgcgctt cctgaccacc aggctctggc ccttgagaag ccagcggggc 60 

tttgtccctg ttgctctcct tgccaaaccc agtcfcctctg ctagtggtgg tfcfccggttgc 120 

gacaccgtcc aggttcccag gcaggaaccg ctcggccfcgg ctgcttagct acttttcact 180 

gaggaggtgg tggaaggtgt cgcctgctct ggctgagtaa gggfcggctgg ctgagccggc 240 

agcccccgcc ctaggcctgg ctcttcccgg cctctgtact ttgccctcgc tgcctgacag 300 

gttctgctgt gggctctgct gaatggaagt cgctggfcagt ccttttccct ttctccagtc 360 

ggcccacctt gggacacctt gactccaagc ccagcagtaa gtccaacatg afcfccggggcc 420 

gcaactcagc cacctctgct gatgagcagc cccacattgg aaactaccgg ctcctcaaga 480 

ccattggcaa gggtaatttt gccaaggtga agttggcccg acacafccctg acfcgggaaag 540 

aggtagctgt gaagatcatt gacaagactc aactgaactc ctccagcctc cagaaactat 600 

tccgcgaagt aagaataatg aaggttttga atcafccccaa catagttaaa tfcattfcgaag 660 

tgattgagac tgagaaaacg ctctaccttg tcafcggagta cgctagtggc ggagaggtat 720 

ttgattacct agtggcfccat ggcaggatga aagaaaaaga ggctcgagcc aaattccgcc 780 

agatagtgtc tgctgtgcag tactgtcacc agaagttfcat tgtccataga gacttaaagg 840 

cagaaaacct gctcttggat gctgatatga acatcaagat tgcagacttt ggcfctcagca 900 

atgaattcac ctttgggaac aagctggaca ccttctgtgg cagtccccct tatgctgccc 960 

cagaactctt ccagggcaaa aaatatgatg gacccgaggt ggatgtgtgg agcctaggag 102 0 

ttatcctcta tacactggtc agcggatccc tgccttttga tggacagaac ctcaaggagc 1080 

tgcgggaacg ggtactgagg gggaaatacc gtattccatt ctacatgtcc acggactgtg 1140 

aaaacctgct taagaaattt ctcatcctta atcccagcaa gagaggcact ttagagcaaa 1200 

tcatgaaaga tcgatggatg aatgtgggtc acgaagatga tgaactaaag ccttacgtgg 1260 

agccactccc tgactacaag gacccccggc ggacagagct gafcggtgtcc afcgggttata 1320 

cacgggaaga gatccaggac tcgctggfcgg gccagagata caacgaggtg afcggccacct 1380 

atctgcfccct gggctacaag agctccgagc tggaaggcga caccatcacp ctgaaacccc 1440 

ggccttcagc tgatctaacc aatagcagcg cccaattccc atcccacaag gtacagcgaa 1500 

gcgtgtcggc caatcccaag cagcggcgct tcagcgacca ggcfcggtcct gccattccca 1560 
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cctctaattc ttactctaag aagactcaga gtaacaacgc agaaaataag c:ggcctgagg 1620 

aggaccggga gtcagggcgg aaagccagca gcacagccaa ggtgccfcgcc agccccctgc 1680 

ccggtctgga gaggaagaag accaccccaa ccccctccac gaacagcgtc ctctccacca 1740 

gcacaaatcg aagcaggaat tccccacttt tggagcgggc cagcctcggc caggcctcca 1800 

tccagaatgg caaagacagc acagcccccc agcgtgtccc tgfctgcctcc ccatccgccc 1860 

acaacatcag cagcagtggt ggagccccag accgaactaa cttcccccgg g-gtgfcgtcca 1920 

gccgaagcac cttccatgct gggcagctcc gacaggtgcg ggaccagcag atatttgccct 1980 

acggtgtgac cccagcctct ccctctggcc acagccaggg ccggcggggg Q-cctctggga 2040 

gcatcttcag caagttcacc tccaagtttg tacgcaggaa cctgaatgaa cctgaaagca 2100 

aagaccgagt ggagacgctc agacctcacg tggtgggcag tggcggcaac gacaaagaaa 2160 

aggaagaatt tcgggaggcc aagccccgct ccctccgctt cacgtggagt atgaagacca 2220 

cgagctccat ggagcccaac gagatgatgc gggagatccg caaggtgctg gacgcgaaca 22 80 

gctgccagag cgagctgcat gagaagtaca tgctgctgtg catgcacggc acgccgggcc 2340 

acgaggactt cgtgcagtgg gagatggagg tgtgcaaact gccgcggctc tctctcaacg 2400 

gggfctcgatt taagcggata tcgggcacct ccatggcctt caaaaacatt gcctccaaaa 2460 

tagccaacga gctgaagctt taacaggctg ccaggagcgg gggcggcggg; ggcgggccag 252 0 

ctggacgggc tgccggccgt gcgccgcccc acctgggcga gactgcagcg atggattggt 2580 

gtgtctccct gctggcactt ctcccctccc tggcccttct cagttttctc ccacattcac 2640 

ccctgcccag agattccccc ttctcctctc ccctactgga ggcaaaggaa ggggagggtg 2700 

gatggggggg cagggctccc cctcggtact gcggttgcac agagtafcttc gcctaaacca 2760 

agaaattttt tattaccaaa aaga 2784 

<210> 3 

<211> 3103 

<212> DHA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (2941) . . (2941) 
<223> »n" is A, C, G, or T 

<400> 3 

cggtggtggc ggccatgttg ggagcagcag gtccggcggc ggcfcgcctgt gtgccgggcg 60 

cggagcagtg ccgctgaggg caggggagga gcgaggcagg cggccggc tg cggcggcaga 120 

gagtaggcgg agcggcgcgg cccggccgaa aggcggcaca gcccagccgg gggtcggggg 180 

ggtgcggtcc ggagccgctc ggagccggcg cggcctagcc cgagcggcgc atccccgggc 240 
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tggcgtgagc ggctgcccgg cctccccgca cccccggccg gggcccatgc ggcgggtgct 300 

cctgctgtga gaagccccgc ccggccgggc tccgcgcctt cccttccctc ccttcctcca 360 

agcttctcgg ttccctcccc cgagataccg gcgccatgtc cagcgctcgg acccccctac 420 

ccacgctgaa cgagagggac acggagcagc ccaccttggg acaccttgac tccaagccca 480 

gcagtaagtc caacatgatt cggggccgca actcagccac ctctgctgat gagcagcccc 540 

acattggaaa ctaccggctc ctcaagacca ttggcaaggg taattttgcc aaggtgaagt 600 

tggcccgaca catcctgact gggaaagagg tagctgtgaa gatcattgac aagactcaac 660 

tgaactcctc cagcctccag aaactattcc gcgaagtaag aataatgaag gttttgaatc 720 

atcccaacat agttaaatta tttgaagtga ttgagactga gaaaacgctc taccttgtca 780 
tggagtacgc tagtggcgga gaggtatttg attacctagt ggctcatggc aggatgaaag 840 
aaaaagaggc tcgagccaaa ttccgccaga tagtgtctgc tgtgcagtac tgtcaccaga 900 
agtttattgt ccatagagac ttaaaggcag aaaacctgct cttggatgct gatatgaaca 9 60 

tcaagattgc agactttggc ttcagcaatg aattcacctt tgggaacaag ctggacacct 1020 

tctgtggcag tcccccttat gctgccccag aactcttcca gggcaaaaaa tatgatggac 1080 

ccgaggtgga tgtgtggagc ctaggagtta tcctctatac actggtcagc ggatccctgc 1140 

cttttgatgg acagaacctc aaggagctgc gggaacgggt actgagggga aaataccgta 1200 

ttccattcta catgiccacg gactgtgaaa acctgcttaa gaaatttctc attcttaatc 1260 

ccagcaagag aggcacttta gagcaaatca tgaaagatcg atggatgaat gtgggtcacg 1320 

aagatgatga actaaagcct tacgtggagc cactccctga ctacaaggac ccccggcgga 1380 

cagagctgat ggtgtccatg ggttatacac gggaagagat ccaggacfccg ctggtgggcc 1440 

agagatacaa cgaggtgatg gccacctatc tgctcctggg ctacaagagc tccgagctgg 1500 

aaggcgacac catcaccctg aaaccccggc cttcagctga tctgaccaat agcagcgccc 1560 

catccccatc ccacaaggta cagcgcagcg tgtcggccaa tcccaagcag cggcgcttca 1620 

gcgaccaggc tggtcctgcc attcccacct ctaattctta ctctaagaag actcagagta 1680 

acaacgcaga aaataagcgg cctgaggagg accgggagtc agggcggaaa gccagcagca 1740 

cagccaaggt gcctgccagc cccctgcccg gtctggagag gaagaagacc accccaaccc 18 00 

cctccacgga acagcgtcct ctccaccagc acaaatcgaa gcaggaattc cccacttttg 18 60 

gagcgggcca gcctcggcca ggcctccatc cagaatggca aagacagcct aaccatgcca 1920 

gggtcccggg cctccacggc ttctgcttct gccgcagtct ctgcggcccg gccccgccag 1980 

caccagaaat ccatgtcggc ctccgtgcac cccaacaagg ccfcctgggct gccccccacg 2040 

gagagtaact gtgaggtgcc gcggcccagc acagcccccc agcgtgtccc tgttgcctcc 2100 
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2460 




tccy"cttcac 


crfc crcracrfca tcf 


aagaccacga 


gctccatgga 


cr cc g aac cracr 


2520 


aLyauyuy y y 


_ 4- cccrc aa 

CXy d. UL» L» 53 L» 


crcrfccrcfccrcrac 


gegaacaget 


gecagagega 


cr c fc ac at cr an 


2580 


= ra rrfc apat" rf r"« 
day LaLaLy u 


f~ err* fc crfc crc a t 

uy u uy uy ' — a. l 


gcacggcacg 


cccrcjgccacg 


aggac fc fc c g fc 


ctg acrfcer crcra cr 


2640 




rrr* aaani* rrr* G 
y Lctaau uy uu 


rTf* CTCTGfcC feet 

y l y y v-» i— i— 


c fc c aa c cjggg* 


ttcgatttaa 


ere crcrafcs t* r* rr 

£3 v_-vjycj.UCS.UUy 


2700 


y y uclu L LLLa 


f" rrrTr t r i t" fc na 
uy y uu u uuglgl 


a aanattcrcc 


tccaaaatag 


ccaacgagct 


ctaacrn fc t* fc a a 


2760 


Lay y u t_y uuct 


yy»y i -yyyyy 


G crcrc rrrrcrCTCrcr 
*-yy u yyyayy 


ccrcTCTC c acre fc 


crcraG crcrcr c fc cr 


G c crorc g ctg fc rr 
— 1 — yy '-'^y ^ uy 


2820 


r* rrr* r* err* r* G G a 
uyuuyuuuucx 


c c t cr cr crc cracr 
uu L.yyy-y^-y 


acfcgcagcga 


fccrcrafc fccrcrfc cr 


tgtctcccct 


crctCTGcacfct 


2880 


ctcccctccc 


tggcccttct 


cagttttctc 


fctacafcg-ttfc 


gtgggrjrjgtg 


crgagafcfcgtfc 


2940 


ntccagcacc 


ccacattcac 


ccctgcccag 


agattccccc 


ttctcctctc 


ccctactgga 


3000 


ggcaaaggaa 


ggggagggtg 


gatggggggg 


cag-g-gcfcccc 


cctcggtact 


gcggfctgcac 


3060 


agagtattfcc 


gcctaaacca 


agaaattttt 


tattaccaaa 


aag 




3103 



<210> 4 

<211> 2086 

<212> DNA 

<213> Homo sapiens 

<400> 4 

agtccaacat gattegggge cgcaactcag ccacctctgc fcgatgagcag ccccacattg 60 

gaaactaccg gctcctcaag accattggca agcjgfcaattfc tgccaaggtg aagfctggccc 12 0 

gacacatCGt gactgggaaa gaggfcagcfcg tgaagatcat tgacaagact caactgaact 180 

cctccagcct ecagaaaefca fctccgcgaag taagaataat gaaggtttfcg aatcatccca 240 

acatagttaa attatfcfcgaa gtgafctgaga ctcjagaaaac gctctacctt cjtcatggagt 300 

acgctagfcgg cggagaggfca tttgatfcacc tagrtgejetea tggcaggafcg aaagaaaaag 360 

aggefcegage caaafctccgc cagatagtgt ctcjctg-fcgca gtactgfccac cagaagfctta 420 

ttgtccatag agacttaaag gcagaaaacc tgctctfcgga fcgcfcgatatg aacatcaaga 480 

ttgeagaett tggcttcagc aatgaafctca ccfcttcjg-gaa caagctggac accttcfcgtg 540 

gcagtccccc tfcafcgctgcc ccagaactct tccaggcjcaa aaaafcatgat ggacccgagg 600 
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fcggatgtgtg gagcctagga gttatcctct 
atggacagaa cctcaaggag ctgcgggaac 
fcctacatgtc cacggactgt gaaaacctgc 
agagaggcac tttagagcaa atcatgaaag 
atgaactaaa gccttacgtg gagccactcc 
tgatggtgtc catgggttat acacgggaag 
acaacgaggt gatggccacc tatctgctcc 
acaccatcac cctgaaaccc cggccttcag 
catcccacaa ggtacagcgc agcgtgtcgg 
aggctggtcc tgccattccc acctctaatt 
cagaaaataa gcggcctgag gaggaccggg 
aggtgcctgc cagccccctg cccggtctgg 
cgaacagcgt cctctccacc agcacaaatc 
ccagcctcgg ccaggcctcc atccagaatg 
ctgttgcctc cccatccgcc cacaacatca 
acttcccccg gggtgtgtcc agccgaagca 
gggaccagca gaatttgccc tacggtgtga 
gccggcgggg ggcctctggg agcatcttca 
acctgaatga acctgaaagc aaagaccgag 
gtggcggcaa cgacaaagaa aaggaagaat 
fccacgtggag tatgaagacc acgagctcca 
gcaaggtgct ggacgcgaac agctgccaga 
gcatgcacgg cacgccgggc cacgaggact 
fcgccgcggct ctctctcaac ggggttcgat 
tcaaaaacat tgcctccaaa atagccaacg 

<210> 5 

<211> 2248 

<212> DHA 

<213> Homo sapiens 

<400> 5 

agtccaacat gattcggggc cgcaactcag 
gaaactaccg gctcctcaag accattggca 
gacacatcct gactgggaaa gaggtagctg 



PCT/US02/17461 

atacactggt cagcggatcc ctgccttttg 660 

gggtactgag gggaaaatac cgtattccat 720 

ttaagaaatt tctcattctt aatcccagca 780 

atcgatggat gaafcgtgggt cacgaagatg 840 

ctgactacaa ggacccccgg cggacagagc 900 

agatccagga ctcgctggtg ggccagagat 960 

tgggctacaa gagctccgag ctggaaggcg 102 0 

ctgatctgac caatagcagc gccccatccc 1080 

ccaatcccaa gcagcggcgc ttcagcgacc 1140 

cttactctaa gaagactcag agtaacaacg 1200 

agtcagggcg gaaagccagc agcacagcca 1260 

agaggaagaa gaccacccca accccctcca 132 0 

gaagcaggaa ttccccactt ttggagcggg 1380 

gcaaagacag cacagccccc cagcgtgtcc 1440 

gcagcagtgg tggagcccca gaccgaacta 1500 

ccttccatgc tgggcagctc cgacaggtgc 1560 

ccccagcctc tccctctggc cacagccagg 162 0 

gcaagttcac ctccaagfcfcfc gtacgcagga 1680 

tggagacgct cagacctcac gtggfcgggca 1740 

ttcgggaggc caagccccgc tccctccgct 1800 

tggagcccaa cgagatgatg cgggagatcc 1860 

gcgagctgca tgagaagtac ' atgctgctgt 1920 

tcgtgcagtg ggagatggag gtgtgcaaac 1980 

ttaagcggat atcgggcacc tccatggcct 2040 

agctgaagct ttaaca 2086 



ccacctctgc tgatgagcag ccccacattg 60 
agggtaattt tgccaaggtg aagttggccc 120 
tgaagafccat tgacaagact caactgaact- 180 



icsrwMrv --\Ain 
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cctccagcct ccagaaacta ttccgcgaag taagaataat gaaggttttg aatcatccca 240 

acatagttaa attatttgaa gtgatfcgaga* ctgagaaaac gctctacctt gtcatggagfc 300 

acgctagtgg cggagaggta tttgattacc tagtggctca tggcaggatg aaagaaaaag 3 60 

aggctcgagc caaattccgc cagatagtgt ctgctgtgca gtactgtcac cagaagttta 420 

ttgtccatag agacttaaag gcagaaaacc tgctcttgga tgctgatatg aacatcaaga 480 

ttgcagactfc tggcttcagc aatgaattca cctttgggaa caagctggac accttctgtg 540 

gcagtccccc ttatgctgcc ccagaactct tccagggcaa aaaatatgat ggacccgagg 600 

tggatgtgtg gagcctagga gttatcctct atacactggt cagcggatcc ctgccttttg 660 

atggacagaa cctcaaggag ctgcgggaac gggtactgag gggaaaatac cgtattccat 720 

tctacatgtc cacggactgt gaaaacctgc ttaagaaatt tctcattctt aatcccagca 780 

agagaggcac tttagagcaa atcatgaaag atcgatggat gaatgtgggt cacgaagatg 840 

atgaactaaa gccttacgtg gagccactcc ctgactacaa ggacccccgg cggacagagc 900 

tgatggtgtc catgggttat acacgggaag agatccagga ctcgctggtg ggccagagat 960 

acaacgaggt gatggccacc tatctgctcc tgggctacaa gagctccgag ctggaaggcg 102 0 

acaccatcac cctgaaaccc cggccttcag ctgatctgac caatagcagc gccccatccc 1080 

catcccacaa ggtacagcgc agcgtgtcgg ccaatcccaa gcagcggcgc ttcagcgacc 1140 

aggctggtcc tgccattccc acctctaatt cttactctaa gaagactcag agtaacaacg 1200 

cagaaaataa gcggcctgag gaggaccggg agtcagggcg gaaagccagc agcacagcca 1260 

aggtgcctgc cagccccctg cccggtctgg agaggaagaa gaccacccca accccctcca 1320 

cgaacagcgt cctctccacc agcacaaatc gaagcaggaa ttccccactt ttggagcggg 1380 

ccagcctcgg ccaggcctcc atccagaatg gcaaagacag cctaaccatg ccagggtccc 1440 

gggcctccac ggcttctgct tctgccgcag tctctgcggc ccggccccgc cagcaccaga 1500 

aatccatgtc ggcctccgtg caccccaaca aggcctctgg gctgcccccc acggagagta 1560 

acfcgtgaggt g C cgcggccc agcacagccc cccagcgtgt ccctgttgcc tccccatccg 1620 

cccacaacat cagcagcagt ggtggagccc cagaccgaac taacttcccc cggggtgtgt 1680 

ccagccgaag caccttccat gctgggcagc tccgacaggt gcgggaccag cagaatttgc 1740 

cctacggtgt gaccccagcc tctccctctg gccacagcca gggccggcgg ggggcctctg 1800 

ggagcatctt cagcaagttc acctccaagt ttgtacgcag gaacctgaat gaacctgaaa 1860 

gcaaagaccg agtggagacg ctcagacctc acgtggtggg cagtggcggc aacgacaaag 1920 

aaaaggaaga atttcgggag gccaagcccc gctccctccg cttcacgtgg agtatgaaga 1980 

ccacgagctc catggagccc aacgagatga tgcgggagat ccgcaaggtg ctggacgcga 2040 
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acagctgcca gagcgagctg catgagaagt acafcgctgct gtgcatgcac ggcacgccgg 2100 

gccacgagga cttcgtgcag tgggagafcgg aggfcgtgcaa actgccgcgg ctctctctca 2160 

a-cggggfctcg atttaagcgg atatcgggca cctccatggc cttcaaaaac attgcctcca 2220 

aaatagccaa cgagctgaag ctttaaca 2248 

<210> 6 

<211> 2701 

<212> DMA 

<213> Homo sapiens 

<400> 6 

ggcacgaggg ctgaacgaga gggacacgga gcagcccacc ttgggacacc ttgactccaa 60 

gcccagcagt aagtccaaca tgattcgggg ccgcaactca gccacctctg ctgafcgagca 120 

gccccacatt ggaaactacc ggctcctcaa gaccattggc aagggtaatt ttgccaaggt 180 

gaagttggcc cgacacatcc tgactgggaa agaggtagct gtgaagatca ttgacaagac 240 

tcaactgaac tcctccagcc tccagaaact attccgcgaa gtaagaataa tgaaggtttt 300 

gaatcatccc aacatagtta aattatttga agtgattgag actgagaaaa cgctctacct 3 60 

tgtcafcggag tacgctagtg gcggagaggt attfcgatfcac ctagtggctc atggcaggat 420 

gaaagaaaaa gaggctcgag ccaaafctccg ccagatagtg tctgctgtgc agtactgtca 480 

ccagaagttt attgtccata gagacttaaa ggcagaaaac ctgctcttgg atgcfcgatat 540 

gaacatcaag attgcagact ttggcttcag caatgaattc acctfctggga acaagctgga 600 

caccttctgt ggcagtcccc cttatgctgc cccagaactc fctccagggca aaaaatatga 660 

tggacccgag gtggatgtgt ggagcctagg agttatcctc fcatacactgg tcagcggatc 720 

cctgcctttt gatggacaga acctcaagga gctgcgggaa cgggtactga ggggaaaata 780 

ccgtattcca ttctacatgt ccacggactg tgaaaacctg cttaagaaat tfcctcatfcct. 840 

taatcccagc aagagaggca ctttagagca aatcatgaaa gatcgatgga tgaatgtggg 900 

tcacgaagat gatgaactaa agccttacgt ggagccactc cctgactaca aggacccccg 960 

gcggacagag ctgatggtgt ccatgggtta tacacgggaa gagatccagg actcgctggt: 1020 

gggccagaga fcacaacgagg tgatggccac ctafcctgctc ctgggctaca agagctccga 1080 

gctggaaggc gacaccatca ccctgaaacc ccggccttca gctgatctga ccaatagcag 1140 

cgccccatcc ccatcccaca aggtacagcg cagcgtgtcg gccaatccca agcagcggcg 1200 

cttcagcgac caggcagctg gtcctgccat tcccacctct aattcttact ctaagaagac 1260 

tcagagtaac aacgcagaaa ataagcggcc tgaggaggac cgggagtcag ggcggaaagc 1320 

cagcagcaca gccaaggtgc ctgccagccc cctgcccggt ctggagagga agaagaccac 1380 
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cccaaccccc tccacgaaca gcgfccctctc caccagcaca 


aatcgaagca 


ggaattcccc 


1440 


acttttggag cgggccagcc tcggtcaggc ctccatccag 


aatggcaaag 


acagcctaac 


1500 


catgccaggg tcccgggcct ccacggctfcc tgcttcfcgcc 


gcagtctctg 


cggcccggcc 


1560 


ccgccagcac cagaaatcca tgtcggcctc cgtgcacccc 


aacaaggcct 


cfcgggctgcc 


1620 


ccccacggag agtaactgtg aggtgccgcg gcccagcaca 


gccccccagc 


g-fcgtccctgt 


1680 


tgcctcccca tccgcccaca acatcagcag cagtggtigga 


gccccagacc 


gaacfcaacfcfc 


1740 


cccccggggt gtgtccagcc gaagcacctt ccatgcfcggg 


cagctccgac 


aggtgcggga 


1800 


ccagcagaat ttgccctacg gtgtgacccc agcctctccc 


tctggccaca 


gccagggccg 


1860 


gcggggggcc tctgggagca tcttcagcaa gttcacctcc 


aagtttgtac 


gcagaaatct 


1920 


gtctttcagg tttgccagaa ggaacctgaa tgaacctgaa 


agcaaagacc 


g-agtggagac 


1980 


gctcagacct cacgtggtgg gcagtggcgg caacgacaaa 


gaaaaggaag 


aatttcggga 


2040 


ggccaagccc' cgc.tccctcc gcttcacgtg gagtafcgaag 


accacgagct 


ccatggagcc 


2100 


caacgagatg atgcgggaga tccgcaaggt gctggacgcg 


aacagctgcc 


agagcgagct 


2160 


gcatgagaag fcacatgctgc tgtgcatgca cggcacgccg 


ggccacgagg 


acttcgtgca 


2220 


gtgggagatg gaggtgtgca aactgccgcg gctctctctc 


aacggggttc 


gatttaagcg 


2280 


gatatcgggc acctccatgg ccttcaaaaa cattgcctcc 


aaaatagcca 


acgagctgaa 


2340 


gctttaacag gctgccagga gcgggggcgg cgggggcggg 


ccagctggac 


gggctgccgg 


2400 


ccgctgcgcc gccccacctg ggcgagactg cagcgafcgga 


ttggtgtgtc 


tcccctgctg 


2460 


gcacttctcc cctccctggc ccttctcagt tttctcttac 


atgtttgtgg 


g-gggtgggag 


2520 


attgttctcc agccccccac attcacccct gcccagagat 


tcccccttct 


cctctcccct 


2580 


actggaggca aaggaagggg agggtggatg ggggggcagg 


gctccccctc 


ggtactgcgg 


2640 


ttgcacagag tatttcgcct aaaccaagaa afctttfctafct 


accaaaaaaa 


aaaaaaaaaa 


2700 








2701 



<210> 7 

<211> 2112 

<212> DNA 

<213> Homo sapiens 

<400> 7 

cccagcagta agtccaacat gattcggggc cgcaactcag ccacctctgc tgatgagcag 
ccccacattg gaaactaccg gctcctcaag accattggca agggtaattt tgccaaggtg 
aagttggccc gacacatcct gactgggaaa gaggtagcfcg tgaagatcat tgacaagact 
caactgaact cctccagcct ccagaaacta ttccgcgaag taagaataat gaaggttttg 
aatcatccca acatagttaa atfcatttgaa gtgattgaga ctgagaaaac gctctacctt 
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gtcatggagt 


acgctagtgg 


cggagaggta 


tttgattacc 


tagtggctca tggcaggatg 


360 


aaagaaaaag 


aggctcgagc 


caaattccgc 


cagafcagtgt 


ctgctgtgca gtactgtcac 


420 


cagaagttta 


ttgtccatag 


agacttaaag 


gcagaaaacc 


tgctcttgga 


tgctgatatg 


480 


aacatcaaga 


ttgcagactt 


tggcttcagc 


aatgaattca 


cctttgggaa 


caagctggac 


540 


accttctgtg 


gcagtccccc 


ttatgctgcc 


ccagaactct 


tccagggcaa 


aaaatatgat 


600 


ggacccgagg 


tggatgtgtg gagcctagga 


gttatcctct 


atacactggt 


cagcggatcc 


660 


ctgccttttg 


atggacagaa 


cctcaaggag 


ctgcgggaac 


gggtactgag 


gggaaaatac 


720 


cgtattccat 


tctacatgtc 


cacggactgt 


gaaaacctgc 


fctaagaaatt 


tctcattctt 


780 


aatcccagca 


agagaggcac 


tttagagcaa 


atcatgaaag 


atcgatggat 


gaatgtgggt 


840 


cacgaagatg 


atgaactaaa 


gccttacgtg gagccactcc 


ctgactacaa 


ggacccccgg 


900 


cggacagagc 


tgatggtgtc 


catgggttat 


acacgggaag 


agate cagga 


ctcgctggtg 


960 


ggccagagat 


acaac'gaggt 


gatggccacc 


tatcfcgctcc 


tgggctacaa 


gagctccgag 


1020 


ctggaaggcg 


acaccatcac 


cctgaaaccc 


cggccttcag 


ctgatctgac 


caatagcagc 


1080 


gccccatccc 


catcccacaa 


ggtacagcgc 


agcgtgtcgg 


ccaatcccaa 


gcagcggcgc 


1140 


ttcagcgacc 


aggctggtcc 


tgccattccc 


acctctaatt 


cttactctaa 


gaagactcag 


1200 


agtaacaacg 


cagaaaafcaa 


gcggcctgag gaggaccggg agtcagggcg gaaagccagc 


1260 


agcacagcca 


aggtgcctgc 


cagccccctg cccggtctgg 


agaggaagaa 


gaccacccca 


1320 


accccctcca 


cgaacagcgt 


cctctccacc 


agcacaaatc 


gaagcaggaa 


ttccccactt 


1380 


ttggagcggg 


ccagcctcgg 


ccaggcctcc 


atccagaatg 


gcaaagacag 


cacagccccc 


1440 


cagcgtgtcc 


ctgttgcctc 


cccatccgcc 


cacaacatca 


gcagcagtgg 


tggagcccca 


1500 


gaccgaacta 


acttcccccg 


gggtgtgtcc 


agccgaagca 


ccttcca.tgc 


tgggcagctc 


1560 


cgacaggtgc 


gggaccagca 


gaatfctgccc 


fcacggtgfcga 


ccccagcctc 


tccctctggc 


1620 


cacagccagg 


gccggcgggg 


ggcctctggg 


agcatcttca 


gcaagttcac 


ctccaagttt 


1680 


gtacgcagga 


acctgaatga 


acctgaaagc 


aaagaccgag 


tggagacget 


cagacctcac 


1740 


gtggtgggca 


gfcggcggcaa 


cgacaaagaa 


aaggaagaat 


ttegggagge 


caagccccgc 


1800 


tccctccgct 


tcacgtggag 


tatgaagacc 


acgagctcca 


tggagcccaa 


cgagatgatg 


1860 


cgggagatcc 


gcaaggtgct 


ggacgcgaac 


agcfcgccaga 


gegagctgea 


tgagaagtac 


1920 


atgcfcgctgt 


gcatgcacgg 


cacgccgggc 


cacgaggact 


fccgtgcagtg 


ggagatggag 


1980 


gtgtgcaaac 


fcgccgcggct: 


ctctctcaac 


ggggttcgat 


ttaageggat 


atcgggcacc 


2040 


tccatggcct 


tcaaaaacat 


tgcctccaaa 


atagccaacg 


agefcgaaget 


ttaacaggct 


2100 


gccaggagcg 


gg 










2112 
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<210> 8 

<211> 2965 

<212> DNA 

<213> Homo sapiens 

<400> 8 

cgggcaaccg cctcgcccga agccctccct cgfctactgtc cgcatacccc g^cggcgccg 60 

ccgcggaaag cggctccccc tcctcttact ccgcgtcctc ttccctcttt cccccgccgg 120 

ggcacgcttg ttgcaccgfcc ccgcggcctg cgggagccgc tcgccccgga cttgagctcg 180 

cgtacgaccc afcfcfccctgtc gccccccgga gcccgcacca cagcccggcc ggtctagacc 240 

ccggcagacc ccgctggccg cacaaaatgt cggcccggac gccattgccg acggtgaacg 300 

agcgggacac ggtaaatcat acgactgtgg atggatatac tgaaccacac atccagccta 360 

ccaagtcgag tagcagacag aacatccccc ggtgtagaaa ctccattacg tcagcaacag 420 

atgaacagcc tcacattgga aattaccgtt tacaaaaaac aatagggaag ggaaattttg 480 

ccaaagtcaa attggcaaga cacgttctaa cfcggtagaga ggfctgctgtg aaaataatag 540 

acaaaactca gctaaatcct accagtctac aaaagttatt tcgagaagta cgaataatga 600 

agatactgaa tcatcctaat atagtaaaat tgtttgaagt tattgaaaca gagaagactc 660 

tctatttagt catggaatac gcgagtgggg gtgaagtatt tgattactta gfctgcccatg 720 

gaagaatgaa agagaaagag gcccgtgcaa aatttaggca gattgtatct gctgtacagt 780 

attgtcatca aaagtacatt gttcaccgtg afccttaaggc tgaaaacctt ctccttgatg 840 

gtgatatgaa fcattaaaatt gctgactttg gfctttagtaa tgaatttaca gttgggaaca 900 

aattggacac attttgtgga agcccaccct afcgctgctcc cgagcttttc caaggaaaga 960 

agtafcgatgg gcctgaagtg gatgtgtgga gfcctgggcgt cattctctat acattagtca 1020 

gtggctcctt gcctttcgat ggccagaatt taaaggaact gcgagagcga gfctttacgag 1080 

ggaagfcaccg tattcccttc tatatgtcca cagactgtga aaatcttctg aagaaattat 1140 

tagtcctgaa tccaataaag agaggcagct tggaacaaat aatgaaagat cgatggatga 1200 

atgttggtca tgaagaggaa gaactaaagc catatactga gcctgatccg gatttcaatg 12 60 

acacaaaaag aatagacatt atggtcacca tgggctttgc acgagatgaa ataaatgatg 1320 

ccttaataaa tcagaagtat gatgaagtta tggcfcactta tattcttcta ggtagaaaac 1380 

cacctgaatt tgaaggtggt gaatcgttat ccagtggaaa cttgtgtcag aggtcccggc 1440 

ccagtagtga cttaaacaac agcactcttc agtcccctgc tcacctgaag gtccagagaa 1500 

gtatctcagc aaatcagaag cagcggcgtt tcagfcgatca tgctggtcca tccattcctc 1560 

ctgctgtafcc atataccaaa agacctcagg cfcaacagtgt ggaaagtgaa cagaaagagg 1620 
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agtgggacaa agatgtggct cgaaaacttg gcagcacaac agttggatca aaaagcgaga 
tgacfcgcaag ccctcttgta gggccagaga ggaaaaaatc ttcaactafct ccaagtaaca 
atgtgfcattc tggaggtagc atggcaagaa ggaatacata tgtctgtgaa aggaccacag 
atcgatacgt agcattgcag aatggaaaag acagcagcct tacggagatg tctgtgagta 
gcatatcttc tgcaggctct tctgtggcct ctgctgtccc ctcagcacga ccccgccacc 
agaagtccat gtccacttct ggtcatccta ttaaagtcac actgccaacc attaaagacg 
gctctgaagc ttaccggcct ggtacaaccc agagagtgcc tgctgcttcc ccatctgctc 
acag tafctag tactgcgact ccagaccgga cccgttttcc ccgagggagc tcaagccgaa 
gcacttticca tggtgaacag ctccgggagc gacgcagcgt tgcttataat gggccacctg 
cttcaccatc ccatgaaacg ggtgcatttg cacatgccag aaggggaacg tcaactggta 
taataagcaa aatcacatcc aaatttgttc gcagggatcc aagtgaaggc gaagccagtg 
gcagaaccga cacctcaaga agtacatcag gggaaccaaa agaaagagac aaggaagagg 
gtaaagattc taagccgcgt tctttgcggt tcacatggag tatgaagacc actagttcaa 
tggaccctaa tgacatgatg agagaaatcc gaaaagtgtt agatgcaaat aactgtgatt 
atgagcaaaa agagagattt ttgcttttct gtgtccatgg agacgctaga caggatagcc 
tcgtgcagtg ggagafcggaa gtctgcaagt tgccacgact gtcacttaat ggggttcgct 
tcaagcgaat atctgggaca tctattgcct ttaagaacat tgcatcaaaa atagcaaatg 
agcttaagct gtaaagaagt ccaaatttac aggttcaggg aagatacata catatatgag 
gtacagtttt tgaatgtact ggtaatgcct aatgtggtct gcctgtgaafc ctccccatgt 
agaatfctgcc cttaatgcaa taaggttata catagttatg aactgtaaaa ttaaagtcag 
tatgaactat aataaatatc tgtagcttaa aaagtaggtt cacatgtaca ggtaagtata 
ttgtgtattt ctgttcattt tctgttcata gagttgtata ataaaacatg attgcttaaa 
aacttgaaaa aaaaaaaaaa aaaaa 



1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

2965 



<210> 9 

<211> 3210 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ggcgcggcgg cggcggtggc tgtgaccgcg cggaccgagc cgagacattc gcgccggggg 
atcgggcgcc gccgccgctg ggccccgggc gcgtggatgc ggctgggtcg ggcggcgccg 
tacacctgag gcggagaacg gggcgcggcg cgggtgacgc tgfccagggcc gcggttcctg 
acgcccaggc gctcgccagg acgagccagg cagtgatttg aggcaccggc ttcaccttca 
cccatggtcc ggagagccta gcggggctcg ccaccgcctc ccggctcccc ttccacgcct 
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catcctgcca 


gcctcgccgc 


cccgccagcg 


ccgggcaacc 


yCCUCyCCCg 


aagccctccc 


3 60 


tcgtfcactgt 


ccgcataccc 


cggcggcgcc 


gc c gcgggaa 


gcggctcccc 


ctcctcttcc 


42 0 


tccgcgtcct 


cttccctctt 


tcccccgccg 


gggccgcttg 


ttgcaccgcc 


ccgcggcctg 


yi on 

480 


cgggagccgc 


tcgccccggc 


cttgfcgctcg 


cgtccgcacc 


ccfcttcctgt 


cgccccccgg 


540 


ggcccgcacc 


acagcccggc 


cggcgagacc 


ccggccagac 


cccgctgccc 


gcacaaaatg 


600 


tcggcccgga 


cgccattgcc 


gacggtgaac 


gagcgggaca 


cggaaaatca 


ta.catctgtg 


660 


gatggatata 


ctgaaccaca 


catccagcct 


accaagtcga 


gtagcagaca 


gaacatcccc 


720 


cggtgtagaa 


actccattac 


gtcagcaaca 


gatgaacagc 


ctcacattgg 


aaattaccgt 


780 


ttacaaaaaa 


caatagggaa 


gggaaatttt 


gccaaagtca 


aax,tggcaag 


acaegttcta 


840 


actggtagag aggttgctgt 


gaaaataata 


gacaaaactc 


agct_.aaat_.cc 


taccagtcta 


900 


caaaagttat 


ttcgagaagt 


acgaataatg aagatactga 


atcatccuaa 


t a t aggfc gaa 


y do 


gtatttgatt 


acttagttgc 


ccatggaaga 


atgaaagaga 


aagaggc ccg 


tgcaaaattt 


1020 


aggcagattg 


tatctgctgt 


acagtattgt 


catcaaaagt 


acattgttca 


ccgtgatctt 


1080 


aagctgaaaa 


ccttctcctt 


gatggtgata 


tgaatattaa 


aattgetgae 


ttfcggfcfctta 


1140 


gtaatgaatt 


tacagttggg 


aacaaattgg 


acacattttg 


tggaagecca 


ccctafcgctg 


1200 


ctcccgagct 


tttccaagga 


aagaagtatg 


atgggcctga 


agfcggafc gtg 


tg-gagtctgg 


12 60 


gcgtcattct 


ctatacatta 


gtcagtggct 


ccfctgccttt 


cgatggccag 


aatttaaagg 


1320 


aactgcgaga gcgagtttta 


cgagggaagt 


accgtattcc 


cttctatatg 


tccacagact 


1380 


gtgaaaatct 


tctgaagaaa 


ttattagtcc 


tgaatccaat 


aaagagaggc 


agctfcggaac 


1440 


aaafcaatgaa 


agatcgatgg 


atgaatgttg gtcatgaaga 


ggaagaacta 


aagecatata 


1500 


ctgagcctga 


tccggatttc 


aatgacacaa 


aaagaataga 




accatgggct 


T r r a 


ttgcacgaga 


tgaaataaat 


gatgccttaa taaatcagaa 


/-r4- si 4*" rx rra -§■* <~rra _a 
y LaLyaLyaa 


gctatggcta 


1620 


cttatattct 


tctaggtaga 


aaaccacctg 


aatttgaagg 


v-ggx-gaaccg 


fcfcafcccagfcg 


1680 


gaaacttgtg 


tcagaggtcc 


cggcccagta 


gtgacttaaa 


/~» si /~t -a rrr* 2a /■*» +~ 
CJ clcLC d.y L- dCJ l_ 


cttcagtccc 


•*! *~7 /i n 
J- 740 


ctgctcacct 


gaaggtccag 


agaagtatct 


cagcaaatca 


gaagcagegg 


cgtfcfccagfcg 


1800 


atcafcgctgg 


tccatccatt 


cctcctgctg tatcatatac 


caaaagacct 


caggctaaca 


1860 


gtgtggaaag 


tgaacagaaa 


gaggagtggg 


acaaagatgt 


ggc ticgaaaa 


cttggcagca 


1920 


caacagttgg 


afccaaaaagc 


gagatgactg 


caagccctct 


tgtagggeca 


gagaggaaaa 


1980 


aatcttcaac 


tattccaagt 


aacaatgtgt 


atfcctggagg 


tagcatggca 


agaaggaata 


2040 


catatgtctg 


tgaaaggacc 


acagatcgat 


acgtagcatt 


gcagaatgga 


aaagacagca 


2100 


gccttacgga 


gatgtctgtg 


agtagcatat 


ctfcctgcagg 


ctcttctgtg 


gcctcfcgctg 


2160 
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tcccctcagc 
tcacactgcc 
tgccfcgctgc 
ttccccgagg 
gcgttgctta 
ccagaagggg 
atccaagtga 
caaaagaaag 
ggagtatgaa 
tgttagatgc 
atggagacgc 
gactgtcact 
acattgcatc 
agggaagata 
gtctgcctgt 
tatgaacfcgt 
ggttcacafcg 
tataataaaa 



acgaccccgc 

aaccattaaa 

ttccccatct 

gagctcaagc 

taatgggcca 

aacgtcaact 

aggcgaagcc 

agacaaggaa 

gaccactagt 

aaataactgt 

tagacaggat 

taatggggtt 

aaaaatagca 

catacatata 

gaatctcccc 

aaaattaaag 

tacaggtaag 

catgattgct 



caccagaagt 

gacggctctg 

gctcacagta 

cgaagcactt 

cctgcttcac 

ggtataataa 

agtggcagaa 

gagggtaaag 

tcaatggacc 

gafctatgagc 

agcctcgtgc 

cgcttcaagc 

aatgagctta 

tgaggfcacag 

atgtagaatt 

tcagtatgaa 

tatattgtgt 

taaaaacttg 



ccatgtccac 

aagcttaccg 

ttagtactgc 

tccatggfcga 

catcccatga 

gcaaaatcac 

ccgacacctc 

attctaagcc 

ctaatgacat 

aaaaagagag 

agtgggagat 

gaatatcfcgg 

agcbgtaaag 

tttttgaatg 

tgcccttaat 

ctataataaa 

attfcctgttc 



ttctggfccat 

gcctggfcaca 

gactccagac 

acagctccgg 

aacgggtgca 

atccaaattt 

aagaagtaca 

gcgttctttg 

gatgagagaa 

atfcfcttgctt 

ggaagtctgc 

gacafccfcatt 

aagtccaaat 

fcactggfcaat 

gcaataaggt 

tatctgtagc 

afctttctgtt 



cctattaaag 

acccagagag 

cggacccgtt 

gagcgacgca 

tfcfcgcacatg 

gttcgcaggg 

tcaggggaac 

cggttcacafc 

atccgaaaag 

ttcfcgtgtcc 

aagttgccac 

gcctttaaga 

fcfcacaggttc 

gcctaatgtg 

tafcacatagt 

ttaaaaagta 

cafcagagttg 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3210 



<210> 10 

<211> 2505 

<212> DNA 

<213> Homo sapiens 

<400> 10 

gctggccgca caaaatgtcg gcccggacgc cattgccgac 
aaaatcatac atctgfcggat ggafcatactg aaccacacat 
gcagacagaa catcccccgg tgtagaaact ccattacgtc 
acattggaaa ttaccgttta caaaaaacaa tagggaaggg 
tggcaagaca cgttctaact ggtagagagg ttgctgfcgaa 
taaatcctac cagtctacaa aagfctatttc gagaagtacg 
atcctaatat agtaaaattg tttgaagtta ttgaaacaga 
tggaatacgc gagtgggggt gaagtatttg attacttagt 
agaaagaggc ccgtgcaaaa tttaggcaga ttgtatctgc 



ggtgaacgag 
ccagcctacc 
agcaacagat 
aaattfcfcgcc 
aataatagac 
aataatgaag 
gaagactctc 
tgcccatgga 
tgtacagtat 



cgggacacgg 
aagtcgagta 
gaacagcctc 
aaagtcaaat 
aaaactcagc 
atactgaatc 
tattfcagtca 
agaafcgaaag 
tgtcatcaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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agtacattgt tcaccgtgat cttaaggctg 
ttaaaattgc tgactttggt tttagtaafcg 
tttgtggaag cccaccctat gctgctcccg 
ctgaagtgga tgtgtggagt ctgggcgtca 
ctttcgatgg ccagaattta aaggaactgc 
ttcccttcta tatgtccaca gactgtgaaa 
caataaagag aggcagcttg gaacaaataa 
aagaggaaga actaaagcca tatactgagc 
tagacattat ggtcaccatg ggctttgcac 
agaagfcatga tgaagttafcg gctacttata 
aaggtggtga atcgttatcc agtggaaact 
taaacaacag cactcttcag tcccctgctc 
atcagaagca gcggcgtttc agtgafccatg 
ataccaaaag acctcaggct aacagtgtgg 
atgfcggctcg aaaacttggc agcacaacag 
ctcttgtagg gccagagagg aaaaaatctt 
gaggtagcat ggcaagaagg aatacatatg 
cattgcagaa tggaaaagac agcagcctta 
caggctcttc tgtggcctct gctgtcccct 
ccacttctgg tcatcctatt aaagtcacac 
accggcctgg tacaacccag agagtgcctg 
ctgcgactcc agaccggacc cgttttcccc 
gfcgaacagct ccgggagcga cgcagcgttg 
atgaaacggg tgcatttgca catgccagaa 
tcacatccaa atttgtfccgc agggatccaa 
cctcaagaag tacatcaggg gaaccaaaag 
agccgcgttc tttgcggttc acatggagta 
acatgatgag agaaatccga aaagtgttag 
agagattttt gctfcttctgt gtccatggag 
agatggaagt cfcgcaagfcfcg ccacgactgt 
ctgggacatc tattgccttt aagaacattg 
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aaaaccttct 


ccttgatggt 


gatatgaata 


600 


aafctfcacagt 


tgggaacaaa 


ttggacacat 


660 


agcttttcca 


aggaaagaag 


tatgafcgggc 


- 720 


ttctctatac 


attagfccagt 


Q-gctccttgc 


780 


gagagcgagt 


tttacgaggg 


aagtaccgta 


840 


afccttctgaa gaaattatta g-tcctgaatc 


900 


tgaaagatcg afcggatgaat 


©ttggtcatg 


960 


ctgafcccgga 


tttcaatgac 


acaaaaagaa 


1020 


gaga t gaaat 


aaatgatgcc 


ttaataaatc 


1080 


ttcttctagg tagaaaacca 


cctgaatttg 


1140 


tgtgtcagag 


gtcccggccc 


agtagfcgact 


1200 


acctgaaggt 


ccagagaagt 


atctcagcaa 


1260 


cfcggtccatc 


cattcctcct 


gctgfcatcat 


1320 


aaagtgaaca 


gaaagaggag 


tgggacaaag 


1380 


tfcggafccaaa 


aagcgagatg 


actgcaagcc 


1440 


caacfcattcc 


aagtaacaat 


gtgtattctg 


1500 


tctgfcgaaag 


gaccacagat 


cgatacgtag 


1560 


cg-gagatgtc 


tgtgagtagc 


atatcttctg 


1620 


cagcacgacc 


ccgccaccag 


aagtccatgt 


1680 


tgccaaccat 


taaagacggc 


tctgaagctt 


1740 


ctgcttcccc 


atctgctcac 


agtattagta 


1800 


gagggagctc 


aagccgaagc 


actttccatg 


1860 


cttataatgg gccacctgct 


tcaccatccc 


1920 


g-g-ggaacgtc 


aactggtata 


ataagcaaaa 


1980 


gtgaaggcga 


agccagfcggc 


agaaccgaca 


2040 


aaagagacaa 


ggaagagggt 


aaagattcta 


2100 


tgaagaccac 


tagfctcaatg 


gaccctaatg 


2160 


atgcaaataa 


ctgtgattat 


gagcaaaaag 


2220 


acgctagaca 


ggatagcctc 


gtgcagfcggg 


2280 


cacttaatgg ggttcgcfctc 


aagcgaatat 


2340 


catcaaaaat 


agcaaatgag 


cttaagctgt 


2400 
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aaagaagtcc aaatttacag gttcagggaa gatacataca tatatgaggt acagtttttg 2460 
aatytactgg ba atgcctaa tgtggtctgc ctgtgaatct cccca 2505 



<210> 11 
<211> 4638 
<212> BHA 



<213> Homo 


sapiens 
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ccaccgcctc 


ccggctcccc 


ttccacgcct 


300 




ercefcccrcccrc ccccrccacrccr 


ccgggcaacc 


gcctcgcccg 


aagccctccc 


360 
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gcggctcccc 


ctcctctt.ee 


420 


tccyuy luu l» 


ct-t-ccctctt tccccccrcccr 


gggccgcttg 


ttgcaccgcc 


ccgcggcctg 


480 


uyyycty>-»-y'- 


tcgccccggc cttgtgctcg 


cgtccgcacc 


cctttcctgt 


cgccccccgg 


540 


y y uwuyv* c*. w w 


acagcccggc cggcgagacc 


ccggccagac 


cccgcfcgccc 


gcacaaaatg 


600 


— w y y C=L 


cgccattgcc gaeggtgaac 


gagegggaca 


eggaaaatea tacatctgtg 


660 


f-f-i 4" rrrr 3 fc a fc a 


ctgaaccaca catccagcct 


ac c aagfc c ga 


gtagcagaca 


gaacatcccc 


720 


v y y Ly t— ciy C2.cs. 


actccattac gtcagcaaca 


gatgaacagc 


ctcacattgg aaattaccgt 


780 


l_. u> cX v— e3.cxctca.c3 ca. 


caatagggaa gggaaatttt 


gecaaagtea 


aattggcaag acaegttcta 


840 


cxC tyy ucty cxy 


aggttgctgt gaaaataata 


gacaaaactc 


agctaaatcc 


taccagtcta 


900 


r»« a a sort" tat 


ttcgagaagt acgaataatg 


aagatactga 


atcatcctaa 


tataggtgaa 


960 


rr1- a t~ ttcratt 


acttagttgc ccatggaaga 


atgaaagaga 


aagaggcccg 


tgcaaaattt 


1020 


^5 rrrreacrafc tcr 

Cty y *-ay c*. <— uy 


tatctgetgt acagtattgt 


catcaaaagt 


acafctgttca ccgrtgafcctfc 


1080 


aaggctgaaa 


accttctcct tgatggtgat 


atgaatatta 


aaattgctga ctttggtttt 


1140 


agtaatgaat 


ttacagttgg gaacaaattg gacacatttt 


gtggaagccc 


accctatgct 


1200 


gctcccgagc 


ttttccaagg aaagaagtat 


gatgggcctg 


aagtggatgt 


gtggagtctg 


1260 


ggcgtcattc 


tctatacatt agtcagtggc 


tccttgcctfc 


tegatggeca 


gaatttaaag 


1320 


gaactgcgag 


agcgagtttt acgagggaag 


tacegtatte 


ccttctatat 


gtccacagac 


1380 


tgtgaaaatc 


ttctgaagaa attattagtc 


ctgaatccaa 


taaagagagg 


cagcttggaa 


1440 


caaataatga 


aagatcgatg gatgaatgtt 


ggtcatgaag 


aggaagaact 


aaagecatat 


1500 


actgagcctg 


ateeggattt caatgacaca 


aaaagaatag 


acattatggt 


caccatgggc 


1560 
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tttgcacgag atgaaataaa tgatgcctta ataaatcaga agtatgatga agttatggct 1620 

acttatattc ttctaggtag aaaaccacct gaatttgaag gtggtgaatc g-ttafcccagt 1680 

ggaaacttgt gtcagaggtc ccggcccagt agtgacttaa acaacagcac tcttcagtcc 1740 

cctgcfccacc tgaaggtcca gagaagtatc tcagcaaatc agaagcagcg g-cgtttcagt 1800 

gatcatgctg gtccatccat tcctcctgct gtatcatata ccaaaagacc fccaggctaac 1860 

agtgtggaaa gtgaacagaa agaggagtgg gacaaagatg tggctcgaaa acttggcagc 1920 

acaacagttg gatcaaaaag cgagatgact gcaagccctc ttgtagggcc agagaggaaa 1980 

aaatcttcaa ctattccaag taacaatgtg tafctctggag gtagcatggc aagaaggaat 2040 

acatatgtct gtgaaaggac cacagatcga tacgtagcafc tgcagaafcgg aaaagacagc 2100 

agccttacgg agatgtctgt gagtagcata tcttctgcag gctcttctgt ggcctctgct 2160 

gtcccctcag cacgaccccg ccaccagaag tccatgtcca ctifcctggtca tcctattaaa 2220 

gtcacactgc caaccattaa agacggctct gaagcttacc ggcctggtac aacccagaga 2280 

gtgcctgctg cttccccatc tgctcacagt afctagtactg cgactccaga ccggacccgt 2340 

tttccccgag ggagctcaag ccgaagcact tfcccafcggtg aacagctccg ggagcgacgc 2400 

agcgttgctt ataafcgggcc acctgcttca ccatcccatg aaacgggtgc atttgcacat 2460 

gccagaaggg gaacgtcaac tggtataata agcaaaatca catccaaatt tgttcgcaga 2520 

agtacatcag gggaaccaaa agaaagagac aaggaagagg gtaaagattc taagccgcgt 2580 

tctttgcggt tcacatggag tatgaagacc actagfctcaa tggaccctaa fcgacatgatg 2640 

agagaaatcc gaaaagtgtt agatgcaaat aactgtgatt atgagcaaaa agagagattt 2700 

ttgcfctttct gtgtccatgg agacgctaga caggatagcc tcgtgcagtg ggagatggaa 27 60 

gtctgcaagt tgccacgact gtcacttaat ggggttcgct tcaagcgaat afcctgggaca 2820 

tctattgcct ttaagaacat tgcatcaaaa atagcaaatg agcttatgct gtaaagaagt 2880 

ccaaatttac aggttcaggg aagatacata catatatgag gtacagtttt tgaafcgtacfc 2940 

ggtaatgcct aatgtggtct gcctgtgaat ctccccatgt agaatttgcc cttaatgcaa 3000 

taaggttata catagttatg aactgtaaaa ttaaagtcag tatgaactat aataaatatc 3060 

tgtagcttaa aaagtaggtt cacatgtaca ggtaagtata ttgtgtattt ctgttcattt 3120 

tctgttcata gagttgtata ataaaacatg attgcttaaa aacttgtata gfctgtcfcaga 3180 

tttctgcacc tgaatgtatg tttgatgcfct fcgatttgaaa atgttctfccc ctgttattta 3240 

cattctggtg ggfcttttaaa attcttacct ccatcatgca attttgaaaa ttgtgtccag 3300 

aattaaaagt gcatagaaat agcctttaca attgtagcat ggacctttaa aaattgtttt 3360 

aaaatcttat ttaaatttaa accagaagct gaaaaataga tcagctttat tatacacaaa 342 0 
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attattactg 


cfctatctttg 


ctcttttcct 


tgttatcccg 


caaggtttag 


ttgagaagat 


3480 


acaaaafcgtt 


tacagtgttg 


gcacttagag 


tttttaaatt 


caagtacatg 


aaattcagta 


3540 


atagcattgc 


ctfcgagctaa 


ctaggaagta 


ccgggaaaaa 


agfctaaatct 


acatcaagtt 


3600 


tcttttgaac 


tttsgaagtgt 


tttctgaccc 


actgctaact 


gtagcaacaa 


aafcttaaaag 


3660 


aaaaaaaaca 


tactttatct: 


ggctatfcafca 


acataaactg 


tcacgtaggt 


fctgctgcctt 


3720 


cagaataccg 


caatttaatt 


gcgggaatat 


aataatattg 


ggactgtfctc 


acagcacaaa 


3780 


ctcatcttta cagtgttgat 


caatgcatca 


gttaagaaat 


aatgccacct 


caggaattaa 


3840 


ctggcattgg gaacatttgc 


ctcattctcc 


fcgctatcctc 


ttcattcacc 


ccfcgccactg 


3900 


taatatctat 


aagtacttaa 


gagacttgfcg 


agcaaaacat 


actatttata acagtatatg 


3960 


attgatttat 


gcttatgtgg 


ttgttcagtt 


tgtfccccatg 


taactcgttt 


gttittaaata 


4020 


ttttgccaga 


tttcttgtat 


ttattccaca 


tcattatgcc 


fcafcaatgfcgc 


cgcfctfcgfcga 


4080 


ttgggcattt 


gcctactttt 


ctttcataat 


bagfcgafcata 


tgcgatgtaa 


aaccactagt 


4140 


aaaggtacat 


tttaatactt 


gttattttat 


actgaattag 


cctfcggaggt 


tgactgfcgca 


4200 


atgttattta 


ctgttgtaat 


tactgtaata 


ccaacatatg 


ggccccatct 


gcacactcct 


42 60 


gaaaaacaga 


aagtgtatfcc 


aaattttafcc 


agtttaaaga 


aaataaagct 


gtgataaata 


4320 


ctgtaattcc 


aacctacatt 


agaaggfccta 


agtgtaggtg 


atgfcgccatt 


ccataatggc 


4380 


ttccagacta 


gggtgaattt 


tatgttctgt 


actgtactgt 


gafcgtagctt 


tcttctgtaa 


4440 


cagtfcatgtt 


ttaaaattaa 


gtgagtttfct 


tfctttgcctt 


agcaaagggt 


ggtgtttgaa 


4500 


aaaaaaaatg 


tgtagcccct 


ttttaaccta 


gtgttcattc 


aaaaaaaaafc 


tgatgcaaat 


4560 


ctttattcac 


tttcactggt 


gcacactgaa 


attttacttg 


aacagttctc 


ataataaagc 


4620 


acttgtcttt 


tgctcttt 










4638 



<210> 12 

<211> 2720 

<212> DNA 

<213> Homo sapiens 

<400> 12 

tcatggaata cgcgagtggg ggtgaagfcat ttgattactt agttgcccat ggaagaatga * 60 

aagagaaaga ggcccgtgca aaattfcaggc agattgtatc tgctgtacag tattgtcatc 120 

aaaagtacat tgttcaccgt gatcttaagg ctgaaaacct tcfcccttgat ggtgatatga 180 

atattaaaat tgctgacttt ggtttfcagta afcgaatttac agttgggaac aaattggaca 240 

cattttgtgg aagcccaccc tatgctgctc ccgagctttt ccaaggaaag aagtafcgatg 300 

gtcctgaagt ggatgtgtgg agfcctgggcg tcattctcta tacattagtc agtggctcct 360 
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tgcctttcga tggccagaat ttaaaggaac tgcgagagcg agttttacga gg-gaagtacc 420 

gtattccctt cfcatatgfccc acagactgtg aaaatcttct gaagaaatta ttagtcctga 480 

atccaataaa gagaggcagc ttggaacaaa taatgaaaga tcgatggatg aeitgttggtc 540 

atgaagagga agaactaaag ccatatactg agcc'tgatcc ggatttcaat gacacaaaaa 600 

gaatagacat tatggtcacc atgggctttg cacgagatga aataaatgat gccttaataa 660 

atcagaagta tgatgaagtt atggctactt atattcttct aggtagaaaa ccacctgaat 720 

ttgaaggtgg tgaatcgtta tccagtggaa acttgtgtca gaggtcccgg cocagtagtg 780 

acttaaacaa cagcactctt cagtcccctg ctcacctgaa ggtccagaga agrtatctcag 840 

caaatcagaa gcagcggcgt ttcagtgatc atgctggtcc atccattcct ccstgctgtat 900 

catataccaa aagacctcag gctaacagtg tggaaagtga acagaaagag geigtgggaca 960 

aagatgtggc tcgaaaactt ggcagcacaa cagttggatc aaaaagcgag atgactgcaa 1020 

gccctcttgt agggccagag aggaaaaaat ctbcaactat tccaagtaac aetfcgtgtatt 1080 

ctggaggtag catggcaaga aggaatacat atgtctgtga aaggaccaca gettcgatacg 1140 

tagcattgca gaatggaaaa aacagcagcc ttacggagat gtctgtgagt agcatatctt 1200 

ctgcaggctc ttctgtggcc tctgctgccc cctcagcacg accccgccac cagaagtcca 1260 

tgtccacttc tggtcatcct attaaagtca cactgccaac cattaaagac ggctctgaag 1320 

cttaccggcc tggtacaacc cagagagtgc ctg-ctgcttc cccatctgct cacagtatta 1380 

gtactgcgac tccagaccgg acccgttttc cccgagggag ctcaagccga agcactttcc 1440 

atggtgaaca gctccgggag cgacgcagcg ttg-cttataa tgggccacct gcttcaccat 1500 

cccatgaaac gggtgcattt gcacatgcca gaaggggaac gtcaactggt ataataagca 1560 

aaatcacatc caaatttgtt cgcagaagta catcagggga accaaaagaa agagacaagg 1620 

aag-agggtaa agattctaag ccgcgttctt tgcggttcac atggagtatg aagaccacta 1680 

gttcaatgga ccctaatgac atgatgagag aaatccgaaa agtgttagat gcaaataact 1740 

gtgattatga gcaaaaagag agatttttgc ttttctgtgt ccatggagac gctagacagg 1800 

atagcctcgt gcagtgggag atggaagtct gcaagttgca cgactgtcac ttaatggggt 1860 

tcgcttcaag cgaatatctg ggacatctat tgcctttaag aacattgcat caaaaatagc 1920 

aaafcgagctt aagctgtaaa gaagtccaaa tttacaggtfc cagggaagat acatacatat 1980 

atgaggtaca gtttttgaat gtactggtaa tgcctaatgt ggtctgcctg tgaatctccc 2040 

catgtagaat ttgcccttaa tgcaataagg ttatacatag ttatgaactg taaaattaaa 2100 

gtcagtatga actataataa atatctgtag cttaaaaagt aggttcacat gtacaggtaa 2160 

gfcafcatfcgtg tatttctgtt cattttctgt tcatagagtfc gtataataaa acatgattgc 2220 
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ttaaaaactt gtafcagfctgt ctagatttct 
tgaaaatgfct cttccctgtt atttacattc 
atgcaatttt gaaaattgtg tccagaatta 
agcatggaoc tttaaaaatt gttttaaaat 
atagatcagc tttattatac acaaaattat 
tcccgcaagg tttagttgag aagatacaaa 
aaattcaagt acatgaaatt cagtaatagc 
aaaaaagtta aatctacatc aagtttcttt 
taactgtagc aacaaaattt 

<210> 13 

<211> 2698 

<212> DNA 

<213> Homo sapiens 

<400> 13 

gagctgaaat tcgcggtgcg acgggaggga 
gcggggcgcc ctgaggcaag gggacgccgg 
ctcggctccg ccactgccgc cctcccggtc 
aatgagcccc gggacccgcc gggggacggc 
tagggggaag ggagccgccc tccccacggc 
cgctggtcgc cggcctccta gggcfcgtgct 
ttaaagtgca gtaaaatgtc cactaggacc 
gaaaaccaca cgtcacatgg agatgggcgt 
ggagctcggt gtagaaactc tatagcctcc 
tacagactgt tgaaaacaat cggcaagggg 
atcctfcacag gcagagaggt tgcaataaaa 
agtctacaaa agctcttcag agaagtaaga 
gtgaagttat tcgaagtcat tgaaactgaa 
agtggaggtg aagtatttga ctatttggtt 
agatctaaat ttagacagat tgtgtctgca 
catcgagacc tcaaggctga aaatctattg 
gatttcggtt ttagcaatga atttactgtt 
cctccatacg cagcacctga gctcttccag 
gtgtggagtc tgggggtcat tttatacaca 
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gcacctgaat gtafcgtttga tgctfctgatt 2280 
cggtgggttt ttaaaattct tacctccatc 2340 
aaagtgcata gaaatagcct fcfcacaattgt 2400 
cttatttaaa. tttaaaccag aagctgaaaa 2460 
tactgcttat cttfcgctctt ttccttgfcta 2520 
atgtttacag tgttggcact fcagagfctttt 2580 
attgccfcfcga gctaactagg aagtaccggg 2640 
tgaactttga agtgfcttfcct gacccactgc 2700 

2720 



gtggagaagg 


aggtgagggg 


gcccaggatc 


60 


tgggfccgaag 


cgcagcccgc 


cgcccgcagg 


120 


tcctcgcctc 


gggcgccgag 


gcagggagag 


180 


ccgggccagg 


cccgggatct 


agaacggccg 


240 


gccttttcgg 


aactgccgtg 


gactcgagga 


300 


gttttgtttt 


gaccctcgca 


ttgtgcagaa 


360 


ccattgccaa 


cggfcgaafcga 


acgagacact 


420 


caagaagtta 


cctctcgtac 


cagccgctca 


480 


tgtgcagatg 


aacaacctca 


catcggaaac 


540 


aattttgcaa 


aagtaaaatt 


ggcaagaaat 


600 


ataattgaca 


aaactcagtt 


gaatccaaca 


660 


ataatgaaga 


ttttaaatca 


tcccaatata 


720 


aaaacactct 


acctaateat 


ggaatatgca 


780 


gcacatggca 


ggatgaagga 


aaaagaagca 


840 


gttcaatact 


gccatcagaa aeggatcgta 


900 


ttagatgccg 


atatgaacat 


taaaatagca 


960 


ggcggtaaac 


tcgacacgtt 


tfcgtggcagt 


1020 


ggcaagaaat 


atgacgggcc 


agaagtggat 


1080 


ctagtcagtg 


gctcacttcc 


ctttgatggg 


1140 
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caaaacctaa aggaactgag agagagagta ttaagaggga aatacagaat tcccttctac 1200 

atgtctacag actgtgaaaa ccttctcaaa cgfcttcctgg tgctaaatcc aattaaacgc 1260 

ggcactctag agcaaatcat gaaggacagg tggatcaatg cagggcatga agaagatgaa 1320 

ctcaaaccat ttgttgaacc agagctagac atctcagacc aaaaaagaat agatattatg 13 80 

gtgggaatgg gatattcaca agaagaaatt caagaatctc tfcagtaagat gaaatacgat 1440 

gaaatcacag ctacatattt gttattgggg agaaaatctt cagagctgga fcgctagtgat 1500 

tccagttcta gcagcaatct ttcacttgct aaggttzaggc cgagcagtga tctcaacaac 1560 

agtactggcc agtctcctca ccacaaagtg cagagaagfcg tttcttcaag ccaaaagcaa 1620 

agacgctaca gtgaccatgc tggaccagct attccfcfccfcg ttgtggcgta tccgaaaagg 1680 

agtcagacaa gcactgcaga tggtgacctc aaagaagatg gaatttcctc ccggaaatca 1740 

agtggcagtg ctgttggagg aaagggaatt gctccagcca gtcccatgct fcgggaatgca 1800 

agtaatccta ataaggcgga tattcctgaa cgcaagaaaa gctccactgt ccctagtagt 1860 

aacacagcat ctggtggaat gacacgacga aatacttatg tttgcagtga gagaactaca 1920 

gctgatagac actcagtgat tcagaatggc aaagaaaaca gcactattcc tgatcagaga 1980 

actccagttg cttcaacaca cagtatcagt agtgcagcca ccccagatcg aatccgcttc 2040 

ccaagaggca ctgccagtcg tagcactttc cacggccagc cccgggaacg gcgaaccgca 2100 

acatataatg gccctcctgc ctctcccagc ctgtcccatg aagccacacc attgtcccag 2160 

actcgaagcc gaggctccac taatctcttt agtaaattaa cttcaaaact cacaaggagt 2220 

cgcaatgtat ctgctgagca aaaagatgaa aacaaagaag caaagcct eg atccctacgc 2280 

ttcacctgga gcatgaaaac cactagttca atggatcccg gggacatgat gegggaaate 2340 

cgcaaagfcgt tggacgccaa taactgegae tatgagcaga gggagegett cttgctcttc 2400 

tgcgtccacg gagatgggca cgeggagaac ctcgtgcagt gggaaatgga agtgtgcaag 2460 

ctgccaagac tgtctctgaa eggggtcegg tttaagcgga tat eggggae atccatagcc 2520 

ttcaaaaata ttgcttccaa aattgccaat gagctaaagc tgtaacccag tgattatgat 2580 

gtaaattaag tagcaagtaa agtgttttcc tgaacactga tggaaatgta tagaataata 2640 

tttaggcaat aaegtctgea tcttctaaat catgaaatta aagtcfcgagg acgagagc 2698 

<210> 14 

<211> 2914 

<212> IMA 

<213> Homo sapiens 

<400> 14 

gacggcccgg gccaggcccg ggatctagaa eggcegtagg gggaagggag ccgccctccc 60 
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cacggcgcct fcttcggaact gccgtggact; cgaggacgct ggtcgccggc ctcctagggc 120 

tg ty u L ybLL fcg fc-bttgacc ctcgcattgt gcagaattaa agtgcagtaa aatgtccact 180 

aggaccccat tgccaacggt gaatgaacga gacactgaaa accacacgtc acatggagat 240 

gggcgtcaag aagttacctc tcgtaccagc cgctcaggag ctcggtgtag aaactctata 300 

gcctcctgtg cagatgaaca acctcacatc ggaaactaca gacfcgttgaa aacaatcggc 360 

aaggggaatt ttgcaaaagt aaaattggca agacatatcc ttacaggcag agaggttgca 420 

ataaaaataa ttgacaaaac tcagttgaat ccaacaagtc tacaaaagct cttcagagaa 480 

gtaagaataa tgaagatttt aaatcatccc aatatagtga agttattcga agtcattgaa 540 

actcaaaaaa cactctacct aatcatggaa tacgcaagtg gaggtaaagt atttgactat 600 

ttggttgcac atggcaggat gaaggaaaaa gaagcaagat ctaaatttag acagatfcgtg 660 

tctgcagttc aatactgcca tcagaaacgg atcgtacatc gagacctcaa' ggctgaaaat 720 

ctattgttag atgccgatat gaacattaaa atagcagatt tcggttttag caatgaattt 780 

actgttggcg gtaaactcga cacgttttgt ggcagtcctc catacgcagc acctgagctc 840 

ttccagggca agaaatatga cgggccagaa gtggatgtgt ggagtctggg ggtcatttta 900 

tacacactag tcagtggctc acttcccttt gatgggcaaa acctaaagga actgagagag 960 

agagtattaa gagggaaata cagaattccc ttctacatgt ctacagactg tgaaaacctt 1020 

ctcaaacgtt tcctggtgct aaatccaatt aaacgcggca ctctagagca aatcatgaag 1080 

gacaggtgga tcaatgcagg gcatgaagaa gatgaactca aaccatttgt tgaaccagag 1140 

ctagacatct cagaccaaaa aagaatagat attatggtgg gaatgggata ttcacaagaa 1200 

gaaattcaag aatctcttag taagatgaaa tacgatgaaa tcacagctac atatttgtta 1260 

ttggggagaa aatcttcaga ggttaggccg agcagtgatc tcaacaacag tactggccag 132 0 
tctcctcacc acaaagtgca gagaagtgtt tcfctcaagcc aaaagcaaag acgctacagt • 1380 

gaccatgctg gaccaggtat tccttctgtt gtggcgtatc cgaaaaggag tcagaccagc 1440 

actgcagata gtgacctcaa agaagatgga atttcctccc ggaaatcaac tggcagtgct 1500 

gttggaggaa agggaattgc tccagccagt cccatgcttg ggaatgcaag taatcctaat 1560 

aaggcggata ttcctgaacg caagaaaagc tccactgtcc ctagtagtaa cacagcatct 162 0 

ggtggaatga cacgacgaaa tacfctatgtt tgcagtgaga gaactacaga tgatagacac 1680 

tcagtgattc agaatggcaa agaaaacagc actattcctg atcagagaac tccagttgct 1740 

tcaacacaca gtatcagtag tgcagccacc ccagatcgaa tccgcttccc aagaggcact 1800 

gccagfccgta gcactttcca cggccagccc cgggaacggc gaaccgcaac atataatggc 1860 

cctcctgcct ctcccagcct gtcccatgaa gccacaccat tgtcccagac tcgaagccga 1920 
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ggctccacta 


cfcctctttag taaattaact 


tcaaaactca 


caaggagtcg 


caatgtatct 


1980 


gctaagcaaa 


aagatgaaaa caaagaagca 


aagcctcgat 


ccctacgctt 


ciaccfcggagc 


2040 


atgaaaacca 


ctagttcaat ggatcccggg 


gacatgatgc 


gggaaatccg 


oaaagtgttg 


2100 


gacgccaata 


actgcgacta tgagcagagg gagcgcttct 


tgctcttctg 


cgtccacgga 


2160 


gatgggcacg 


cggagaacct cgtgcagtgg gaaatggaag tgtgcaagct 


gccaagactg 


2220 


tctctgaacg 


gggtccggtt taagcggata tcggggacat 


ccatagcctt 


caaaaatatt 


2280 


gcttccaaaa 


fctgccaatga gctaaagctg taacccagtg attatgatgt 


aaattaagta 


2340 


gcaagtaaag 


fcgttttcctg aacactgatg gaaatgtata gaataatatt 


taggcaataa 


2400 


cgtctgcatc 


ttctaaatca tgaaattaaa gfcctgaggac 


gagagcacgc 


crtgggagcga 


2460 


aagctggccfc 


•ttttfcctacg aatgcactac 


attaaagatg 


tgcaacctat 


gcgccccctg 


2520 


ccctacttcc 


gttaccctga gagtcggcgt 


gfcggccccat: 


ctccatgtgc 


ctcccgtctg 


2580 


ggtgggtgfcg 


agagtggacg gtatgtgtgt 


gaagtggtgt 


atatggaagc 


atctccctac 


2640 


actggcagcc 


agtcattact agtacctctg 


cgggagatca 


tccggfcgcta 


aaacattaca 


2700 


gttgccaagg 


aggaaaatac tgaatgactg 


ctaagaatta 


accttaagac 


cagttcatag 


2760 


ttaatacagg 


tttacagttc atgcctgtgg 


tfcttgfcgttt 


gttgttttgt 


gtfcfcttttag 


2820 


tgcaaaaggt 


fctaaafcttat agttgtgaac 


attgcttgtg 


tgfcgfcfcfcfcfcc 


taagtagatt 


2880 


cacaagataa 


ttaaaaattc actttttctc 


aggt 






2914 



<210> 15 

<211> 3895 

<212> DNA 

<213> Homo sapiens 

<400> 15 

ctgcaggaat tccgatcctt ccgcaggttc acctacggaa accttgfctac gacttttact 60 

tcctctagat agtcaagttc gaccgtcttc tcagcgctcc gccagggccg tgggccgacc 120 

ccggcggggc cgatccgagg gcctcactaa accatccaat cggtagtagc gacgggcggt 180 

gtgtacaaag ggcagggact taatcaacgc aagcfctatga cccgcactta ctgggaattc 240 

ctcgttcatg gggaataatt gcaatccccg atccccatca cgaatggggt tcaacgggtt 300 

acccgcgcct gccggcgtag ggtaggcaca cgctgagcca gtcagtgtag cgcgcgtgca 360 

gccccggaca tctaagggca tcacagacct gttattgctc aatctcgggt ggctgaacgc 420 

cacttgtccc tctaagaagt tgggggacgc cgaccgctcg ggggtcgcgt aactagttag 480 

catgccagag tctcgttcgt tatcggaatt aaccagacaa atcgctccac caactaagaa 540 

cggccatgca ccaccaccca cggaatcgag aaagagctat caatctgtca atcctgtccg 600 

tgtccgggcc gggtgaggtt tcccgtgttg ag-tcaaatta agccgcaggc tccactcctg 660 
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gtggfcgccct tccgtcaatt ccttfcaagtt tcagctttgc aaccatactc cccccggaac 720 

ccaaagactt tggtttcccg gaagctgccc ggcgggtcat gggaataacg ccgccgcatc 780 

gccggtcggc atcgtttatg gtcggaacta cgacggtatc tgatcgtctt cgaacctccg 840 

acfcfcfccgttc ttgattaatg aaaacattct tggcaaatgc tttcgctctg gtccgtcttg 900 

cgccggtcca agaatttcgg aattccgcag cggcggccag cagggcggag gctgaggcag 960 

caagctcgct agagagggag aagcagtcgg gcgcaggcgc ctcctccgca gcccgctcca 1020 

tggfccggcgc ccacagcccg cggcggcctg tcttgcgctc cacttccttc acatcctcct 1080 

ccgcctcctc gttttcaggc gccgccggcg gcgctgtgtg gaggcccgcg agctgaaatt 1140 

cgcggtgcga cgggagggag tggagaagga ggtgaggggg cccaggatcg cggggcgccc 12 00 

tgaggcaagg ggacgccggc gggccgaagc gcagcccgcc gcccgcaggc tcggctccgc 1260 

cactgccgcc ctcccggtct cctcgcctcg gccgccgagg cagggagaga atgagccccg 1320 

ggacccgccg ggggacggcc cgggccaggc ccgggatcta gacggccgta gggggaaggg 1380 

agccgccctc cccacggcgc cttttcggaa ctgccgtgga ctcgaggacg ctggtcgccg 1440 

gcctcctagg gctgtgctgt tttgttttga ccctcgcatt gtgcagaatt aaagtgcagt 1500 

aaaatgtcca ctaggacccc attgccaacg gtgaatgaac gagacactga aaaccacacg 1560 

tcacatggag atgggcgtca agaagttacc tctcgtacca gccgctcagg agctcggfcgt 1620 

agaaactcta fcagcctcctg tgcagatgaa caacctcaca tcggaaacta cagactgttg 1680 

aaaacaatcg gcaaggggaa ttttgcaaaa gfcaaaattgg caagacatat ccttacaggc 1740 

agagaggttg caataaaaat aattgacaaa actcagttga atccaacaag tctacaaaag 1800 

ctcfctcagag aagtaagaat aatgaagatt ttaaatcatc ccaatatagt gaagttattc 1860 

gaagtcattg aaactgaaaa aacactctac ctaatcatgg aatatgcaag tggaggtgaa 1920 

gtatttgact atttggttgc acatggcaag atgaaggaaa aagaagcaag atctaaattt 1980 

agacagggtt gtcaagctgg acagactatt aaagttcaag tctcctttga ttfcgcttagt 2 040 

ctgatgttta catttattgt gtctgcagtt caatactgcc atcagaaacg gatcgtacat 2100 

cgagacctca aggctgaaaa tctattgtta gatgccgata tgaacattaa aatagcagat 2160 

ttcggtttta gcaatgaatt tacfcgttggc ggtaaactcg acacgttttg tggcagtcct 2220 

ccatacgcag cacctgagct cttccagggc aagaaatatg acgggccaga agtggatgtg 2280 

tggagtctgg gggtcatttt atacacacta gtcagtggct cacttccctt tgatgggcaa 2340 

aacctaaagg aactgagaga gagagtatta agagggaaat acagaattcc cttctacatg 2400 

tctacagact gtgaaaacct tctcaaacgt ttcctggtgc taaatccaat taaacgcggc 2460 

actctagagc aaatcatgaa ggacaggtgg atcaatgcag ggcatgaaga agatgaactc 2520 
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aaaccatttg ttgaaccaga gctagacatc tcagaccaaa aaagaataga tattatggtg 2580 

ggaatgggat attcacaaga agaaattcaa gaatctctta gtaagatgaa atacgatgaa 2 640 

atcacagcta catatttgtt attggggaga aaatcttcag agctggatgc ta.gtgattcc 2700 

agttctagca gcaatctttc acttgctaag gtfcaggccga gcagtgatct caacaacagt 2760 

actggccagt ctcctcacca caaagtgcag agaagfcgrbfcfc cttcaagcca aaagcaaaga 2820 

cgctacagtg accatgctgg accagctatt ccttcfcgttg tggcgfcatcc gaaaaggagt 2 880 

cagacaagca ctgcagatgg tgacctcaaa gaagat.ggaa tttcctcccg gaaatcaagt 2940 

ggcagtgctg ttggaggaaa gggaattgct ccagccagtc ccatgcttgg gaatgcaagt 3 000 

aatcctaata aggcggatat tcctgaacgc aagaaaagct ccactgtccc tagtagtaac 3060 

acagcatctg gtggaatgac acgacgaaat acttafcgtfct gcagtgagag aactacagct 3120 

gatagacact cagtgattca gaatggcaaa gaaaacagca ctattcctga tcagagaact 3180 

ccagttgctt caacacacag tatcagtagt gcagccaccc cagatcgaat ccgcttccca 3240 

agaggcactg ccagtcgtag cactttccac ggccagcccc gggaacggcg aaccgcaaca 3300 

tataatggcc ctcctgcctc tcccagcctg tcccatgaag ccacaccatt gtcccagact 3360 

cgaagccgag gctccactaa tctctttagt aaattaactt caaaactcac aaggagtcgc . 3420 

aatgtatctg ctgagcaaaa agatgaaaac aaagaagcaa agcctcgatc cctacgcttc 3480 

acctggagca tgaaaaccac tagttcaafcg gafccccgggg acatgatgcg ggaaatccgc 3540 

aaagtgttgg acgccaataa ctgcgactat gagcagaggg agcgcttctt gctcttctgc 3 600 

gtccacggag atgggcacgc ggagaacctc gfcgcagtggg aaatggaagt gtgcaagctg 3660 

ccaagactgt ctctgaacgg ggtccggfcfct aagcggatat cggggacatc catagccttc 3720 

aaaaatattg cttccaaaat tgccaatgag cfcaaagctgt aacccagtga ttatgatgta 3780 

aattaagtag caagtaaagt gttttccfcga acactgatgg aaatgtatag aataatattt 3840 

aggcaataac gtctgcatct tctaaatcat gaaattaaag tctgaggacg agagc 3895 

<210> 16 

<211> 2145 

<212> DMA 

<213> Homo sapiens 

<400> 16 

atgtccacta ggaccccatt gccaacggfcg aatgaacgag acactgaaaa ccacacgtca 60 

catggagatg ggcgtcaaga agttacctct cgtaccagcc gcfccaggagc tcggtgtaga 120 

aactctatag cctcctgtgc agatgaacaa cctcacatcg gaaactacag actgttgaaa 180 

acaatcggca aggggaattt tgcaaaagta aaattggcaa gacatatcct tacaggcaga 240 
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gaggttgcaa taaaaataat tgacaaaact cagttgaatc caacaagtct acaaaagctc 3 00 

ttcagagaag taagaataat gaagatttta aatcatccca atatagtgaa gttattcgaa 360 

gtcattgaaa ctgaaaaaac actctaccta atcatggaat atgcaagtgg aggtgaagta 420 

tttgactatt tggttgcaca tggcaggatg aaggaaaaag aagcaagatc taaafcttaga 480 

cagattgtgt ctgcagttca atactgccat cagaaacgga tcgtacatcg agacctcaag 540 

gctgaaaatc tattgttaga tgccgatatg aacattaaaa tagcagafctt cggttttagc 600 

aatgaattta ctgttggcgg taaactcgac acgttttgtg gcagtcctcc atacgcagca 660 

cctgagctct tccagggcaa gaaatatgac gggccagaag tggatgtgtg gagtctgggg 720 

gtcattttat acacactagt cagtggctca cttccctttg atgggcaaaa cctaaaggaa 780 

ctgagagaga gagfcattaag agggaaatac agaattccct tctacatgtc tacagactgt 840 

gaaaaccttc tcaaacgttt ccfcggtigcta aatccaatta aacgcggcac tctagagcaa 900 

atcatgaagg acaggtggat caatgcaggg catgaagaag atgaactcaa accatttgtt 960 

gaaccagagc tagacatctc agaccaaaaa agaatagata ttatggtggg aatgggatat 1020 

tcacaagaag aaattcaaga atctcttagt aagatgaaat acgatgaaat cacagctaca 1080 

tafcttgttat tggggagaaa atcttcagag gtfcaggccga gcagtgatct caacaacagt 1140 

actggccagfc ctcctcacca caaagtgcag agaagtgttt cttcaagcca aaagcaaaga 1200 

cgctacagtg accatgctgg accagctatt ccttctgttg tggcgtatcc gaaaaggagt 12 60 

cagaccagca ctgcagatag tgacctcaaa gaagatggaa tttccfccccg gaaafccaagt 1320 

ggcagtgctg ttggaggaaa gggaattgct ccagccagtc ccatgcttgg gaafcgcaagt 1380 

aatcctaata aggcggatat tcctgaacgc aagaaaagct ccactgtccc tagtagtaac 1440 

acagcatcfcg gtggaatgac acgacgaaat acttatgttt gcagfcgagag aactacagct 1500 

gatagacact cagtgattca gaatggcaaa gaaaacagca ctattcctga tcagagaact 1560 

ccagttgcfct caacacacag tatcagtagt gcagccaccc cagatcgaat ccgcttccca 1620 

agaggcactg ccagtcgtag cactttccac ggccagcccc gggaacggcg aaccgcaaca 1680 

tataatggcc ctcctgcctc tcccagcctg tcccatgaag ccacaccatt gtcccagact 1740 

cgaagccgag gctccactaa tctctttagt aaattaactt caaaactcac aaggagtcgc 1800 

aatgtatctg ctgagcaaaa agatgaaaac aaagaagcaa agcctcgatc cctacgcttc 1860 

acctggagca tgaaaaccac tagttcaafcg gatcccgggg acatgatgcg ggaaatccgc 1920 

aaagtgttgg acgccaataa ctgcgactat gagcagaggg agcgcttctt gctcttctgc 1980 

gtccacggag atgggcacgc ggagaacctc gtgcagtggg aaatggaagt gtgcaagctg 2040 

ccaagactgt ctctgaacgg ggtccggttt aagcggatat cggggacatc catagccttc 2100 
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aaaaatattg cttccaaaat tgccaatgag cfcaaagctgt aaccc 2145 

<210> 17 

<211> 2193 

<212> DNA 

<213> Homo sapiens 

<400> 17 

atgtccacta ggaccccatt gccaacggtg aatgaacgag acactgaaaa ccacacgtca 60 

catggagatg ggcgtcaaga agttacctct cgtaccagcc gctcaggagc tcggtgtaga 120 

aactctatag cctcctgtgc agatgaacaa cctcacatcg gaaactacag <actgttgaaa 180 

acaatcggca aggggaattt tgcaaaagta aaattggcaa gacatatcct "bacaggcaga 240 

gaggttgcaa taaaaataat tgacaaaact cagttgaatc caacaagtct acaaaagctc 300 

ttcagagaag taagaataat gaagatttta aatcatccca atatagtgaa gbtattcgaa 360 

gtcattgaaa ctgaaaaaac actctaccta afccatggaat atgcaagtgg <aggtgaagta 420 

tttgactatt tggttgcaca tggcaggatg aaggaaaaag aagcaagatc "taaatttaga 480 

cagattgtgt ctgcagttca atactgccat cagaaacgga tcgtacatcg agacctcaag 540 

gctgaaaatc tafctgttaga tgccgatatg aacattaaaa tagcagattt oggttttagc 600 

aafcgaattta ctgttggcgg taaactcgac acgtfcttgtg gcagtcctcc atacgcagca 660 

cc tgagctct tccagggcaa gaaatatgac gggccagaag tggatgtgtg gagtctgggg 720 

gtcattttat acacactagt cagtggctca cfctccctttg atgggcaaaa cctaaaggaa 780 

ctgagagaga gagtattaag agggaaatac agaattccct tctacatgtc tacagactgt 840 

gaaaaccttc tcaaacgttt cctggtgcta aatccaatta aacgcggcac tctagagcaa 900 

atcatgaagg acaggfcggat caatgcaggg catgaagaag atgaactcaa accatttgtt 960 

gaaccagagc tagacatctc agaccaaaaa agaatagata ttatggtggg aatgggatat 1020 

tcacaagaag aaattcaaga atctctfcagt aagatgaaat acgatgaaat cacagctaca 1080 

tatttgttat tggggagaaa atcttcagag ctggatgcta gtgattccag ttctagcagc 1140 

aatctttcac ttgctaaggt taggccgagc agtgatctca acaacagtac tggccagtct 1200 

cctcaccaca aagtgcagag aagtgtttct tcaagccaaa agcaaagacg ctacagtgac 1260 

catgctggac cagctattcc ttctgttgtg gcgtatccga aaaggagtca gaccagcact 1320 

gcagatagtg acctcaaaga agatggaatt tcctcccgga aatcaagtgg cagtgctgtt 1380 

ggaggaaagg gaattgctcc agccagtccc atgcttggga atgcaagtaa tcctaataag 1440 

gcggatattc ctgaacgcaa gaaaagcfccc actgtcccta gtagtaacac agcatctggt 1500'' 

ggaatgacac gacgaaatac ttatgtttgc agtgagagaa ctacagctga tagacactca 1560 

gtgattcaga atggcaaaga aaacagcact afctcctgafcc agagaactcc agttgcttca 1620 
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acacacagta 
agtcgtagca 
cctgcctctc 
tccactaatc 
gagcaaaaag 
aaaaccacta 
gccaataact 
gggcacgcgg 
ctgaacgggg 
tccaaaattg 



tcagtagtgc 
ctttccacgg 
ccagcctgtc 
tctttagtaa 
atgaaaacaa 
gttcaatgga 
gcgactatga 
agaacctcgt 
fcccggtttaa 
ccaatgagct 



agccacccca 
ccagccccgg 
ccatgaagcc 
attaacttca 
agaagcaaag 
tcccggggac 
gcagagggag 
gcagtgggaa 
gcggatatcg 
aaagctgtaa 



gatcgaatcc 
gaacggcgaa 
acaccattgt 
aaactcacaa 
cctcgatccc 
atgatgcggg 
cgcttcttgc 
atggaagfcgt 
gggacatcca 
ccc 



gcttcccaag 
ccgcaacata 
cccagactcg 
ggagtcgcaa 
tacgcttcac 
aaatccgcaa 
tctfcctgcgt 
gcaagctgcc 
tagccttcaa 



aggcactgcc 
taatggccct 
aagccgaggc 
tgfcafcctgct 
ctggagcatg 
agtgttggac 
ccacggagat 
aagactgtct 
aaatattgct 



<210> 18 

<211> 3373 

<212> DHA 

<213> Homo sapiens 

<400> X8 

caggcgcctc ctccgcagcc cgctccatgg tcggcgccca cagcccgcgg cggcctgtct 
tgcgctccac ttccttcaca tcctcctccg cctcctcgtt ttcaggcgcc gccggcggcg 
ctgtgtggag gcccgcgagc tgaaattcgc ggtgcgacgg gagggagtgg agaaggaggt 
gagggggccc aggatcgcgg ggcgccctga ggcaagggga cgccggcggg ccgaagcgca 
gcccgccgcc cgcaggctcg gctccgccac tgccgccctc ccggtctcct cgcctcggcc 
gccgaggcag ggagagaatg agccccggga cccgccgggg acggcccggg ccaggcccgg 
gatctagaac ggccgtaggg ggaagggagc cgccctcccc acggcgcctt ttcggaactg 
ccgtggactc gaggacgctg gtcgccggcc tccfcagggct gtgctgtttt gttttgaccc 
tcgcattgtg cagaattaaa gtgcagtaaa atgtccacta ggaccccatt gccaacggtg 
aatgaacgag acactgaaaa ccacacgtca catggagatg ggcgtcaaga agfctacctct 
cgtaccagcc gctcaggagc tcggtgtaga aactctatag cctcctgtgc agatgaacaa 
cctcacatcg gaaactacag actgttgaaa acaatcggca aggggaattt tgcaaaagta 
aaattggcaa gacatatcct tacaggcaga gaggttgcaa taaaaataat tgacaaaact 
cagttgaatc caacaagtct acaaaagctc ttcagagaag taagaataat gaagatttta 
aatcatccca atatagtgaa gttattcgaa gtcattgaaa ctgaaaaaac actctaccta 
atcatggaat atgcaagtgg aggtaaagta tttgactatt tggttgcaca tggcaggatg 
aaggaaaaag aagcaagatc taaatttaga cagattgtgt ctgcagttca atactgccat 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2193 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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cagaaacgga tcgtacatcg agacctcaag gcfcgaaaatc tattgtfcaga tgrccgatatg 1080 

aacattaaaa tagcagattt cggttttagc aafcgaattta ctgttggcgg taaactcgac 1140 

acgttttgtg gcagtcctcc atacgcagca cctgagctct tccagggcaa gaaatatgac 1200 

gggccagaag tggatgtgtg gagtctgggg gtcattttat .acacactagt cagtggctca 1260 

cttccctttg atgggcaaaa cctaaaggaa ctgagagaga gagtattaag agggaaatac 1320 

agaattccct tctacatgtc tacagactgt gaaaaccttc tcaaacgttfc cctggtgcta 1380 

aatccaatta aacgcggcac tctagagcaa atcatgaagg acaggtggat caatgcaggg 1440 

catgaagaag atgaactcaa accatttgtt gaaccagagc tagacatctc agaccaaaaa 1500 

agaatagata ttatggtggg aatgggatat tcacaagaag aaattcaaga atctcttagt 1560 

aagatgaaat acgatgaaat cacagctaca tafcttgttat tggggagaaa atcttcagag 1620 

ctggatgcta gtgattccag ttctagcagc aatctttcac ttgctaaggt: taggccgagc 1680 

agtgatctca acaacagtac tggccagtct cctcaccaca aagtgcagag aagi:gtfctct 1740 

tcaagccaaa agcaaagacg ctacagtgac cafcgctggac cagctattcc fctctgttgtg 1800 

gcgtatccga aaaggagtca gaccagcact gcagatagtg acctcaaaga agatggaatt 1860 

tcctcccgga aatcaagtgg cagtgctgtt ggaggaaagg gaattgctcc agccagtccc 1920 

atgcttggga atgcaagtaa tcctaataag gcggatattc ctgaacgcaa gaaaagctcc 1980 

actgtcccta gtagtaacac agcatctggt ggaatgacac gacgaaatac ttatgtttgc 2040 

agtgagagaa ctacagctga tagacactca gtgattcaga atggcaaaga aaacagcact 2100 

attcctgatc agagaactcc agttgcttca acacacagta tcagtagtgc agccacccca 2160 

gatcgaatcc gcttcccaag aggcactgcc agfccgtagca ctttccacgg ccagccccgg 2220 

gaacggcgaa ccgcaacata taatggccct cctgcctctc ccagcctgtc ccatgaagcc 2280 

acaccattgt cccagactcg aagccgaggc tccactaatc tctttagtaa attaacttca 2340 

aaactcacaa ggagaaacat gtcattcagg tttatcaaaa ggcttccaac tgaatatgag 2400 

. aggaacggga gatatgaggg ctcaagtcgc aatgtatctg ctgagcaaaa agatgaaaac 2460 

aaagaagcaa agcctcgatc cctacgcttc acctggagca tgaaaaccac tagttcaatg 2520 

gatcccgggg acatgatgcg ggaaatccgc aaagtgttgg acgccaataa ctgcgactat 2580 

gagcagaggg agcgcttctfc gctcttctgc gtccacggag atgggcacgc ggagaacctc 2640 

gtgcagtggg aaatggaagt gtgcaagctg ccaagactgt ctctgaacgg ggtccggttt 2700 

aagcggatat cggggacatc catagccttc aaaaatattg cttccaaaat tgccaatgag 2760 

ctaaagctgt aacccagtga ttatgatgta aattaagtag caagtaaagt gttttcctga 2 820 

acactgatgg aaatgtatag aataatattt aggcaataac gtctgcatcfc tctaaatcat 2880 
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gaaattaaag tcfcgaggacg agagcacgcc tgggagcgaa agctggcctfc fcfcttctacga 2940 

atgcactaca tfcaaagatgt gcaacctatg cgccccctgc cctacttccg ttaccctgag 300 0 

agtcggcgtg tggccccatc tccatgtgcc tcccgtctgg gtgggtgtga gagtggacgg 3060 

tatgtgtgtg aagtggtgta tatggaagca tctccctaca ctggcagcca gtcattacta 3120 

gtacctctgc gggagatcat ccggtgctaa aacattacag ttgccaagga ggaaaatact 3180 

"gaatgactgc taagaattaa ccttaagacc agttcatagt taafcacaggt ttacagttca 3240 

tgccfcgtggt tttgtgtttg ttgttttgtg ttfcttttagt gcaaaaggtfc taaafcfctata 3300 

gttgtgaaca ttgcttgtgt gtgtttttct aagtagattc acaagataat taaaaattca 3360 

ctttttctca ggt 3373 

<210> 19 

<211> 3609 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (3606) . . (3606) 

<223> "ii" is A, C, G, or T 



60 
120 



gcctgcccgc cgcccccatg gcgcccgggg tccccgctgc acggggccac taggaccctc 

ggcgtccctt cccctccccc gccctgcccc ctctcccgcc gcgcggaccc gggcgttctc 180 

ggcgcccagc fctttgagctc gcgtccccag gccggcgggg ggggagggga agagagggga 240 

ccctgggacc cccgcccccc ccacccggcc gcccctgccc cccgggaccc ggagaagatg 300 

tcttcgcgga cggtgctggc cccgggcaac gatcggaact cggacacgca tggcaccttg 360 

ggcagtggcc gctcctcgga caaaggcccg tcctggtcca gccgctcact gggtgcccgt 420 

tgccggaact ccatcgcctc ctgtcccgag gagcagcccc acgtgggcaa ctaccgcctg 480 

ctgaggacca ttgggaaggg caactctgcc aaagtcaagc tggctcggca catcctcact 540 

ggtcgggagg ttgccatcaa gattatcgac aaaacccagc tgaatcccag cagcctgcag 600 

aagctgttcc gagaagtccg catcatgaag ggcctaaacc accccaacat cgtgaagctc 660 

tttgaggtga fctgagactga gaagacgctg tacctggtga tggagtacgc aagtgctgga 720 

gaagtgtttg actacctcgt gtcgcafcggc cgcatgaagg agaaggaagc tcgagccaag 780 

ttccgacaga ttgtttcggc tgtgcactat tgtcaccaga aaaatattgt acacagggac 840 

ctgaaggctg agaacctctt gctggafcgcc gaggccaaca fccaagattgc tgactttggc 900 

ttcagcaacg agttcacgct gggafccgaag ctggacacgfc tctgcgggag ccccccatat 960 
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gccgccccgg agctgtttca 


gggcaagaag 


tacgacgggc 


cggaggtgga 


cafccfcggagc 


1 090 


ctgggagtca 


tcctgtacac 


cctcgtcagc 


ggctccctgc 


ccttcgacgg 


gcacaacctc 


1080 


aaggagctgc 


gggagcgagt 


actcagaggg 


aagfcaccggg 


fcccctttcta 


catgtcaaca 




gactgtgaga gcatcctgcg 


gagatttttg gtgctgaacc 


cagctaaacg cbgrtactctc 


1200 


gagcaaatca 


tgaaagacaa 


atggafccaac 


atcg-gctatg 


agggtgagga gttgaagcca 


1260 


tacacagagc 


ccgaggagga 


cttcggggac 


accaagagaa 


ttgaggtgat 


99"tgggtatg 


1320 


ggctacacac 


gggaagaaat 


caaagagtcc 


ttgaccagcc 


agaagtacaa 


cg-aagfcgacc 


1380 


gccacctacc 


fccctgctggg 


caggaagact 


gaggagggtg 


gggaccgggg 


cg-ccccaggg 


1440 


ctggccctgg 


cacgggtgcg 


ggcgcccagc 


gacaccacca 


acggaacaag 


ttccagcaaa 


1500 


ggcaccagcc 


acagcaaagg 


gcagcggagt tcctcttcca 


cctaccaccg 


ccagcgcagg 


1560 


catagcgatt 


tctgtggccc 


atcccctgca 


cccctgca.cc 


ccaaacgcag 


cccgacgagc 


1620 


acgggggagg 


cggagctgaa 


ggaggagcgg 


ctgccaggcc 


ggaaggcgag ctgcagcacc 


1680 


gcggggagtg ggagtcgagg gctgcccccc tccagcccca tggfccagcag cgrcccacaac 


1740 


cccaacaagg 


cagagatccc 


agagcggcgg 


aaggacagca 


cgagcacccc 


caacaacctc 


1800 


cctcctagca 


tgatgacccg 




fcacgfcfcfcgca 


cagaacgccc 


acxcrcra'c t CTacr 


1860 


cgcccgtcac 


tgttgccaaa 


tgggaaagaa 


aacagctcag 


gcaccccacg 


ggtgccccct: 


1920 


gcctccccct 


ccagtcacag 


cctggcaccc 


ccatcagggg 


agcggagccg 


cctggcacgc 


1980 


ggttccacca 


tccgcagcac 


cttccatggt 


ggccaggtcc 


gggaccggcg 


ggcagggggt 


2040 


gggggtggtg ggggtgtgca 


gaatgggccc 


cctgcctctc 


ccacactggc 


ccatgaggct 


2100 


gcacccctgc 


ccgccgggcg 


gccccgcccc 


accaccaacc 


tcttcaccaa 


gctgacctcc 


2160 


aaactgaccc 


gaagggttac 


cctcgatccc 


tctaaacggc 


agaactctaa 


ccgctgtgtt 


2220 


tcgggcgcct 


ctctgcccca 


gggatccaag 


atcaggtcgc 


agacgaacct 


gagagaatcg 


2280 


ggggacctga 


ggtcacaagt 


tgccatctac 


ctfcgggatca 


aacggaaacc 


gccccccggc 


2340 


tgcfcccgatt 


cccctggagt 


gtgaagctga 


ccagctcgcg 


cccfccctgag 


gccctgatgg 


2400 


cagcfcctgcg 


ccaggccaca 


gcagccgccc 


gcfcgccgctg 


ccgccagcca 


cagccgttcc 


2460 


tgctggcctg 


cctgcacggg 


ggtgcgggcg 


ggcccgagcc 


cctgtcccac 


ttcgaagtgg 


2520 


aggtctgcca 


gctgccccgg 


ccaggcttgc 


gggrgagttct 


cttccgccgt 


gtggcgggca 


2580 


ccgccctggc 


cttccgcacc 


ctcgtcaccc 


gcatcfcccaa 


cgacctcgag 


ctctgagcca 


2640 


ccacggtccc 


agggccctta 


ctcttcctct 


cccttgtcgc 


cttcacttct 


acaggagggg 


2700 


aaggggccag 


ggaggggatt 


ctcccfcttat 


catcacctca 


gtttccctga 


attatatttg 


2760 


ggggcaaaga 


ttgtcccctc 


tgctgttctc 


tggggccgct 


cagcacagaa 


gaaggatgag 


2820 
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ggggctcagc 

gccctaccaa 

tgaggaaatc 

agctgctgag 

attccttccc 

gggcactttg 

aggagctgcc 

gccccctcac 

tgaccctagg 

aatccaggca 

gtgggggtgg 

ccgaggcccc 

gfctacactgg 

attganccc 



ggggggagct 
gagggcacct 

tfcccattcct 
cctaaagact 
ctcccgtccc 
tacaaatcct 
gctgtcacct 
ctgcagcctg 
attttccttc 
gtggtttttc 
gaaggtaagg 
agttcctata 
actctaagcc 



ggcaccttcc 

gaggagactt 

cccaacagct 

ggagaatctg 

ctcacgctca 

tgtaaatacc 

acggttttta 

ttgcccaata 

cctgccctca 

ctttcggagc 

atggtcgtgg 

tcgggccccc 

acttcttact 



tggagcctcc 
tggggacagg 
caaaattagg 

ggggactggg 
aacccccact 
ccacaccctc 
agttattaca 
aatttaagag 
cctgcaaatg 
ctcggttttc 
aagaaggcag 
cattcatcca 
ccagtagtaa 



agccagtcct 
gcaggggcag 
ccttgggcag 
agtgggggtc 
tcctgcccca 
ccctctgcaa 
ccccgaccct 
agtccccccc 
agttaaagaa 
tcatctgcag 
gatggaactc 
ctcacactcc 
atttattgca 



gtcctccctc 
ggagggaaac 
gggcagggag 
agagaggcag 
ggctggcgcg 
aggtctcttg 
cctcctgtca 
tccccaatgc 
gaggcgtggg 
aatgggagcg 
ggcctcatcc 
cagccaccat 
ataaacaatc 



<210> 20 

<211> 2085 

<212> DHA 

<213> Homo sapiens 

<400> 20 

agatgtcttc gcggacggtg ctggccccgg gcaacgatcg gaactcggac acgcatggca 
ccttgggcag tggccgctcc tcggacaaag gcccgtcctg gtccagccgc tcactgggtg 
cccgttgccg gaactccatc gcctcctgtc ccgaggagca gccccacgtg ggcaactacc 
gcctgctgag gaccattggg aagggcaact ttgccaaagt caagctggct cggcacatcc 
tcactggtcg ggaggttgcc atcaagatta tcgacaaaac ccagctgaat cccagcagcc 
tgcagaagct gttccgagaa gtccgcatca tgaagggcct aaaccacccc aacatcgtga 
agctctttga ggtgattgag actgagaaga cgctgtacct ggtgatggag tacgcaagtg 
ctggagaagt gtttgactac ctcgtgtcgc atggccgcat gaaggagaag gaagctcgag 
ccaagttccg acagattgtt tcggctgtgc actattgtca ccagaaaaat attgtacaca 
gggacctgaa ggctgagaac ctcttgctgg atgccgaggc caacatcaag attgctgact 
ttggcttcag caacgagttc acgctgggat cgaagctgga cacgttctgc gggagccccc 
catatgccgc cccggagctg tttcagggca agaagtacga cgggccggag gtggacatct 
ggagcctggg agtcatcctg tacaccctcg tcagcggctc cctgcccttc gacgggcaca 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
. 3600 
3609 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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acctcaagga gctgcgggag cgagtactca gagggaagta ccgggtccct tfcctacatgt 


840 


caacagactg tgagagcatc 


ctgcggagat ttttggtgct 


gaacccagct 


aaacgctgta 


900 


ctctcgagca aatcatgaaa 


gacaaatgga tcaacatcgg 


ctatgagggt 


gag-gagttga 


960 


agccatacac agagcccgag 


gaggacttcg gggacaccaa 


gagaattgag 


gtg-atggtgg 


1020 


gtatgggcta cacacgggaa gaaatcaaag agtccttgac 


cagccagaag 


tacaacgaag 


1080 


tgaccgccac ctacctcctg 


ctgggcagga agactgagga 


gggtggggac 


cg-g-ggcgccc 


1140 


cagggctggc cctggcacgg gtgcgggcgc ccagrcgacac 


caccaacgga 


acaagttcca 


1200 


gcaaaggcac cagccacagc 


aaagggcagc ggag-ttcctc 


ttccacctac 


caccgccagc 


1260 


gcaggcatag cgatttctgt 


ggcccatccc ctgcacccct 


gcaccccaaa 


cgcagcccga 


1320 


cgagcacggg ggaggcggag 


ctgaaggagg agcg-gctgcc 


aggccggaag 


gcgagctgca 


1380 


gcaccgcggg gagtgggagt 


cgagggctgc ccccctccag 


ccccatggtc 


agcagcgccc 


1440 


acaaccccaa caaggcagag 


atcccagagc ggcg-gaagga 


cagcacgagc 


acccccaaca 


1500 


acctccctcc tagcatgatg 


acccgcagaa acacctacgt 


ttgcacagaa 


cg-cccggggg 


1560 


ctgagcgccc gtcactgttg 


ccaaatggga aagaaaacag 


ctcaggcacc 


ccacgggtgc 


1620 


cccctgcctc cccctccagt 


cacagcctgg cacccccatc 


aggggagcgg 


ag-ccgcctgg 


1680 


cacgcggttc caccatccgc 


agcaccttcc atggtggcca 


ggtccgggac 


corgcgggcag 


1740 


ggggtggggg tggtgggggt gtgcagaatg ggccccctgc 


ctctcccaca 


ctggcccatg 


1800 


aggctgcacc cctgcccgcc 


gggcggcccc gccccaccac 


caacctcttc 


accaagctga 


1860 


cctccaaact gacccgaagg 


gttaccctcg atccctctaa 


acggcagaac 


tctaaccgct 


1920 


gtgtttcggg cgcctctctg ccccagggat ccaagatcag 


gtcgcagacg 


aacctgagag 


1980 


aatcggggga cctgaggtca 


caagttgcca tcfcaccttgg 


gatcaaacgg 


aaaccgcccc 


2040 


ccggctgctc cgattcccct 


ggagtgtgaa gctgaccagc 


tcgcg 




2085 



<210> 21 

<211> 2278 

<212> DNA 

<213> Homo sapiens 

<400> 21 

agatgtcttc gcggacggtg ctggccccgg gcaacgatcg gaactcggac acgcatggca 60 

ccttgggcag tggccgctcc tcggacaaag gcccgtcctg gtccagccgc tcactgggtg 120 

cccgttgccg gaactccatc gcctcctgtc ccgaggagca gccccacgtg ggcaactacc 180 

gcctgctgag gaccattggg aagggcaact ttgccaaagt caagctggct cggcacatcc 240 

tcactggtcg ggaggttgcc atcaagatta tcgacaaaac ccagctgaat cccagcagcc 300 

tgcagaagct gttccgagaa gtccgcatca tgaagggccfc aaaccacccc aacatcgtga 360 
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agctcttfcga ggtgattgag actgagaaga cgcfcgtacct ggtgatggag tacgcaagtg 420 

ctggagaagt gtttgactac ctcgtgtcgc atggccgcat gaaggagaag gaagctcgag 480 

ccaagttccg acagattgtt tcggctgtgc actattgtca ccagaaaaat attgtacaca 540 

gggacctgaa ggctgagaac ctcttgctgg atgccgaggc caacatcaag attgctgact 600 

ttggcttcag caacgagttc acgctgggat cgaagctgga cacgttctgc gggagccccc 660 

catatgccgc cccggagctg tttcagggca agaagtacga cgggccggag gtggacatct 720 

ggagcctggg agtcatcctg tacaccctcg tcagcggctc cctgcccttc gacgggcaca 780 

acctcaagga gctgcgggag cgagtactca gagggaagta ccgggtccct ttctacatgt 840 

caacagactg tgagagcatc ctgcggagat ttttggtgct gaacccagct aaacgctgta 900 

ctctcgagca aatcatgaaa gacaaatgga tcaacat egg ctatgagggt gaggagttga 960 

agccatacac agageccgag gaggacttcg gggacaccaa gagaattgag gtgafcggtgg 1020 

gtatgggcta cacaegggaa gaaatcaaag agtccttgac cagecagaag tacaacgaag 1080 

tgaccgccac ctacctcctg ctgggcagga agactgagga gggtggggac cggggcgccc 1140 

cagggctggc cctggcacgg gtgegggege ccagcgacac caccaacgga acaagttcca 1200 

gcaaaggcac cagccacagc aaagggcagc ggagttcctc ttccacctac caccgccagc 1260 

gcaggcatag cgatttctgt ggcccatccc ctgcacccct gcaccccaaa cgcagcccga 1320 

egagcaeggg ggaggeggag ctgaaggagg agcggctgcc aggceggaag gegagctgea 13 80 

gcaccgcggg gagtgggagt egagggctge ccccctccag ccccatggtc agcagcgccc 1440 

acaaccccaa caaggcagag atcccagagc ggcggaagga cagcacgagc acccccaaca 1500 

acctccctcc tagcatgatg acccgcagaa acacctacgt ttgcacagaa cgcccggggg 1560 

ctgagcgccc gtcactgttg ccaaatggga aagaaaacag ctcaggcacc ccacgggtgc 1620 

cccctgcctc cccctccagt cacagcctgg cacccccatc aggggagegg agccgcctgg 1680 

cacgcggttc caccatccgc agcaccttcc atggtggcca ggtcegggae eggegggcag 1740 

ggggtggggg tggtgggggt gtgcagaatg ggccccctgc ctctcccaca cfcggcccatg 1800 

aggctgcacc cctgcccgcc gggcggcccc gccccaccac caacctcttc accaagctga 1860 

cctccaaact gacccgaagg gtcgcagacg aacctgagag aatcggggga ccfcgaggtca 1920 

caagttgeca fcctaccttgg gatcaaaegg aaaccgcccc ccggctgctc cgattcccct 1980 

ggagtgtgaa gctgaccagc tcgcgccctc ctgaggccct gatggcagct ctgcgccagg 2040 

ccacagcagc cgcccgctgc cgctgccgcc agccacagcc gttcctgctg gcctgcctgc 210 0 

acgggggtgc gggcgggccc gagcccctgt cccacttcga agtggaggtc tgccagctgc 2160 

cccggccagg ctfcgcgggga gttctcttcc gccgtgtggc gggcaccgcc ctggccttcc 222 0 
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gcaccctcgt cacccgcatc tccaacgacc tcgagctctg agccaccacg g-fccccagg 227 8 

<210> 22 

<211> 4917 

<212> DNA 

<213> Homo sapiens 

<400> 22 

agaagatgtc ttcgcggacg gtgctggccc cgggcaacga tcggaactcg gacacgcatg 60 

gcaccttggg cagtggccgc tccfccggaca aaggcccgtc ctggfcccagc cgctcactgg 12 0 

gtgcccgttg ccggaactcc atcgcctcct gtcccgagga gcagccccac Q-fcgggcaact 180 

accgcctgct gaggaccatt gggaagggca actttgccaa agtcaagctg Q*ctcggcaca 240 

tcctcactgg tcgggaggtt gccatcaaga ttatcgacaa aacccagctg etatcccagca 300 

gcctgcagaa gctgttccga gaagtccgca tcatgaaggg cctaaaccac cccaacatcg 3 60 

tgaagctctt tgaggtgatt gagactgaga agacgctgta cctggtgatg gagtacgcaa 420 

gtgctggaga agtgttfcgac tacctcgtgt cgcatggccg catgaaggag eiaggaagcfcc 480 

gagccaagtt ccgacagatt gttfccggctg tgcacfcattg tcaccagaaa eiafcafctgtac 540 

acagggacct gaaggctgag aacctcttgc tggatgccga ggccaacatc aagattgctg 600 

actttggctt cagcaacgag ttcacgctgg gatcgaagct ggacacgttc tgcgggagcc 660 

ccccatatgc cgccccggag ctgfcttcagg gcaagaagta cgacgggccg gaggtggaca 720 

tctggagcct gggagtcatc ctgtacaccc tcgtcagcgg ctccctgccc ttcgacgggc 780 

acaacctcaa ggagctgcgg gagcgagtac tcagagggaa gtaccgggtc cctttctaca 840 

tgtcaacaga ctgtgagagc atcctgcgga gatttfctggt gctgaaccca gctaaacgct 900 

gtactctcga gcaaatcatg aaagacaaat ggatcaacat cggcfcatgag g-gfcgaggagt 960 

tgaagccata cacagagccc gaggaggact tcggggacac caagagaatt ^aggtgatgg 1020 

tgggtatggg ctacacacgg gaagaaatca aagagfccctt gaccagccag aagtacaacg 1080 

aagtgaccgc cacctacctc cfcgctgggca ggaagactga ggagggtggg gaccggggcg 1140 

ccccagggct ggccctggca cgggtgcggg cgcccagcga caccaccaac ggaacaagtt 1200 

ccagcaaagg caccagccac agcaaagggc agcggagttc ctcttccacc taccaccgcc 12 60 

agcgcaggca tagcgatttc tgtggcceat cccctgcacc cctgcacccc aaacgcagcc 1320 

cgacgagcac gggggaggcg gagctgaagg aggagcggct gccaggccgg aaggcgagct 1380 

gcagcaccgc ggggagtggg agtcgagggc tgcccccctc cagccccatg gtcagcagcg 1440 

cccacaaccc caacaaggca gagatcccag agcggcggaa ggacagcacg agcaccccca 1500 

acaacctccc tcctagcatg atgacccgca gaaacaccta cgtttgcaca gaacgcccgg 1560 

36 



0209BB90A2J_> 



WO 02/098890 



PCTYUS02/17461 



gggctgagcg cccgtcactg ttgccaaatg ggaaagaaaa cagctcaggc accccacggg 
tgccccctgc ctccccctcc agtcacagcc tggcaccccc atcaggggag cggagccgcc 
tggcacgcgg ttccaccatc cgcagcacct fcccatggtgg ccaggtccgg gaccggcggg 
cagggggtgg gggtggtggg ggtgtgcaga atgggccccc tgcctctccc acactggccc 
atgaggctgc acccctgccc gccgggcggc cccgccccac caccaacctc ttcaccaagc 
tgacctccaa actgacccga agggttaccc tcgatccctc taaacggcag aactctaacc 
gcfcgtgtttc gggcgcctct* ctgccccagg gatccaagat caggtcgcag acgaacctga 
gagaatcggg ggacctgagg tcacaagttg ccatctacct tgggatcaaa cggaaaccgc 
cccccggctg ctccgattcc cctggagtgt gaagctgacc agctcgcgce ctcctgaggc 
ccfcgafcggca gctctgcgcc aggccacagc agccgcccgc tgccgctgcc gccagccaca 
gccgttcctg ctggcctgcc tgcacggggg tgcgggcggg cccgagcccc tgtcccactt 
cgaagtggag gtctgccagc tgccccggcc aggcttgcgg ggagttctcfc tccgccgtgt 
ggcgggcacc gccctggcct tccgcaccct cgtcacccgc atctccaacg acctcgagct 
ctgagccacc acggtcccag ggcccttact cttcctcfccc cttgtcgcct tcacttctac 
aggaggggaa ggggccaggg aggggattct ccctttatca tcacctcagt ttccctgaat 
tatatttggg ggcaaagatt gtcccctctg ctgttctctg gggccgctca gcacagaaga 
aggatgaggg ggctcagcgg ggggagctgg cacctfccctg gagcctccag ccagtcctgt 
cctccctcgc cctaccaaga gggcacctga ggagactttg gggacagggc aggggcaggg 
agggaaactg aggaaatctt ccatfccctcc caacagcfcca aaattaggcc ttgggcaggg 
gcagggagag ctgctgagcc taaagactgg agaatctggg ggactgggag tgggggtcag 
agaggcagat tccttcccct cccgtcccct cacgctcaaa cccccacttc ctgccccagg 
ctggcgcggg gcactttgta caaatccttg taaatacccc acaccctccc ctctgcaaag 
gtctctfcgag gagctgccgc tgtcacctac ggtttttaag ttattacacc ccgaccctcc 
tcctgtcagc cccctcacct gcagcctgtt gcccaataaa tttaagagag tccccccctc 
cccaatgctg accctaggat tttccttccc tgccctcacc tgcaaatgag ttaaagaaga 
ggcgtgggaa tccaggcagt ggtttttcct ttcggagcct cggttttctc afcctgcagaa 
tgggagcggt gggggtggga aggtaaggat ggtcgtggaa gaaggcagga tggaactcgg 
cctcatcccc gaggccccag ttcctatatc gggcccccca ttcatccact cacactccca 
gccaccatgt tacactggac tctaagccac ttcttactcc agtagtaaat tfcattcaata 
aacaatcatt gacccatgcc tactccatgc caggcccagt gctggacaca gagacatgaa 
gctctgtctg tgggagacag ggattctgac acagacaccg gacaaaccat tgtcttgggg 



1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 



37 



WO 02/098890 PCT7US02/17461 



agcccagaag 


agaaagtggg 


cagggtgggg tcattgggga agatgctcta gagg-aattaa 


3480 


tgctggaatg 


gggtgttgaa 


ggafcgagtag gagttagtfca ggcattgagt ttgccctggg 


3540 


caaaagccca gaagtgggag 


tatgtggtat atctfccagag aactgggtaa ttfcaagtgtg 


3600 


gctgctgtgt 


tgggcatgga 


tggagaatca gcaagagaaa tgctgtatta ggactaataa 


3660 


fcccatctacg 


ctgcttaagc 


aaaaaggtat ttgttggttt atgttactta atagtccagg 


3720 


ggcacctggc 


ttcaggtagg 


tttgatccag gcatcaggcc attgcatcta ttttttcagt 


3780 


gtaagttgaa 


ttctagtaat 


ttttafccaag taagggctcc tttcctggtg gcacagatga 


3840 


cttcagcagt 


tagaagtttc 


tatccctcca gctttctgca gcagaaagac ccteattgtc 


3900 


agtttcccag 


caaaagtccc 


agggcagact ctcafctggcc caaatgggcc atgfcgatttt 


3960 


ctctaaacca 


atcactgtga 


ctctagagtg gccagactca gagctgcact tagfcaggggt 


4020 


tcctcaaagg aaggtcaagt 


gtcatgagca ggagaaaagg catgggagct ggacagatta 


4080 


tagtggttga 


agtctgtgca 


gtacagaagg gcggagctta ttcacacagc acctttgggg 


4140 


ccaaaatgaa 


taagcfcggac 


tttctcccca tggcactggg gaaccatgga agttcaggga 


4200 


acttcaggga 


agaggctfcgg 


tcaattcctg agagcatcct ctgtgctggg gaca.cagtgg 


4260 


taatcaagac 


agccccaaca 


ctgccctcat agagctcaca gtccaatgga ggag-gcagat 


4320 


gtgtcctcag 


gcagcgactg 


ggcagggctg gtatagggga gtccagaggt gatg-cctgcc 


4380 


fccagccaggg 


agggctfccct 


ggaggagaag gagccagcfca gacatggata ggag-fcgcgtt 


4440 


ttaggcacag 


caaatggcac 


atacaagggc cagggagcaa gagagaggac aggfccctcaa 


4500 


caaatggcat 


gtgacfcttgt 


aagtgtagaa ttgctgtgag gtatggggct aggggcgtca 


4560 


gtagggcctt 


gaaggttatg 


gacaggggcc tgggctttct tccaagggca ctgg-gggagc 


4620 


catggcaagg 


ttgtaggtag 


ggtagagatg ggcgggtttg tgctatgtgc agggtggaag 


4680 


ggagggaagt 


tgacaggtca 


gaagatcagg aaagaggtcg gggctggaca gatggggaga 


4740 


gcgcagatag 


atttaagaga 


gtcctgtgag gcaaagtggg caggacctgg taacaggtgt 


4800 


ctggactgtg gctttggctg 


gctcagaagg tccccactgg cgtgfcgtggt ctatgtagcc 


4860 


tctgggtgtg gagctgggat 


cttcaactgg ggacagtaca gtaaagaaca tcacagc 


4917 


<21D> 23 

<211> 3226 

<212> DNA 

<213> Homo sapiens 






<400> 23 
gacccggaga 


agatgtcttc 


gcggacggtg ctggccccgg gcaacgatcg gaactcggac 


60 


acgcatggca 


ccttgggcag 


tggccgctcc tcggacaaag gcccgtcctg gtccagccgc 


120 


tcacfcgggfcg 


cccgttgccg 


gaactccatc gcctcctgfcc ccgaggagca gccccacgtg 


180 
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ggcaactacc 


gcctgctgag 


gaccattggg 


aagggcaact 


ttgccaaagt 


caagctggct 


240 


cggcacatcc 


tcactggtcg 


ggaggttgcc 


atcaagatta 


tcgacaaaac 


ccagctgaat 


300 


cccagcagcc 


tgcagaagct 


gttccgagaa 


gtccgcatca 


tgaagggect 


aaaccacccc 


360 


aacatcgtga 


agctctttga 


ggtgattgag 


actgagaaga 


cgctgtacct 


ggtgatggag 


420 


tacgcaagtg 


ctggagaagt 


gtttgactac 


cfccgtgtcgc 


atggccgcat 


gaaggagaag 


480 


gaagctcgag 


ccaagttccg 


acagattgtt 


tcggctgtgc 


actattgtca 


ccagaaaaat 


540 


attgtacaca 


gggacctgaa 


ggc tgagaac 


etcttgetgg 


atgecgagge 


caacatcaag 


600 


attgctgact 


ttggcttcag 


caacgagttc 


aegctgggat 


cgaagctgga 


cacgttctgc 


660 


gggagccccc 


catatgccgc 


cccggagctg 


tttcagggca 


agaagtacga 


egggceggag 


720 


gfcggacatct 


ggagcctggg 


agtcatcctg 


tacaccctcg 


tcagcggctc 


cctgcccttc 


780 


gacgggcaca 


acctcaagga 


gctgcgggag 


cgagtactca gagggaagta ccgggfcccct 


840 


ttcfcacatgt 


caacagactg 


tgagagcatc 


ctgeggagat 


ttttggtgct 


gaacccagct 


900 


aaacgctgta 


ctctcgagca 


aatcatgaaa 


gacaaatgga 


teaacategg ctatgagggt 


960 


gaggagttga 


agccatacac 


agagcccgag 


gaggacttcg 


gggacaccaa 


gagaattgag 


1020 


gtgatggtgg 


gtatgggcta 


cacacgggaa 


gaaatcaaag 


agtccttgac 


cagecagaag 


1080 


tacaacgaag 


tgaccgccac 


etacctcctg 


ctgggcagga agactgagga gggtggggac 


1140 


— cggggcgccc 


cagggctggc 


cctggcacgg 


-gtgegggege 


ccagcgacac 


caccaacgga 


1200 


acaagttcca 


gcaaaggcac 


cagccacagc 


aaagggcagc 


ggagttcctc 


ttccacctac 


1260 


caccgccagc 


gcaggcatag cgatttctgt 


ggcccatccc 


ctgcacccct 


gcaccccaaa 


1320 


cgcagcccga 


cgagcacggg 


ggaggcggag 


ctgaaggagg agcggctgcc 


aggceggaag 


1380 


gcgagctgca 


gcaccgcggg 


gagtgggagt 


egagggctge 


ccccctccag 


ccccatggtc 


1440 


agcagcgccc 


acaaccccaa 


caaggcagag 


atcccagagc 


ggcggaagga 


cagcacgagc 


1500 


acccccaaca 


acctccctcc 


t age at gat g 


acccgcagaa 


acacctacgt 


ttgcacagaa 


1560 


cgcccggggg 


ctgagcgccc 


gtcactgttg 


ccaaatggga 


aagaaaacag 


ctcaggcacc 


1620 


ccacgggtgc 


cccctgcctc 


cccctccagt 


cacagcctgg 


cacGGGcatc 


aggggagegg 


1680 


agccgcctgg 


cacgcggttc 


caccatccgc 


agcaccttcc 


atggtggcca 


ggtcegggae 


1740 


cggcgggcag 


ggggtggggg tggtgggggt 


gtgcagaatg ggccccctgc 


ctctcccaca 


1800 


ctggcccatg 


aggctgcacc 


cctgcccgcc 


gggoggcccc 


gccccaccac 


caacctcttc 


186.0 


accaagctga 


cctccaaact 


gacccgaagg 


gtcgcagacg aacctgagag aatcggggga 


1920 


cctgaggtca 


caagttgcca 


tctaccttgg 


gatcaaaegg aaaccgcccc 


ccggctgctc 


1980 


cgattcccct 


ggagtgtgaa 


gctgaccagc 


tcgcgccctc 


ctgaggccct 


gatggcagct 


2040 
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ctgcgccagg ccacagcagc cgcccgctgc cgctgccgcc agccacagcc gtfcccfcg-ctg 2100 

gcctgcctgc acgggggtgc gggcgggccc gagcccctgt cccacttcga agfcggag-gtc 2160 

tgccagctgc cccggccagg cttgcgggga gttctcttcc gccgfcgtggc gggjcaccgcc 2220 

ctggccttcc gcaccctcgt cacccgcatc tccaacgacc tcgagctcfcg agccaccacg 2280 

gtcccaggcc cttatcttct ctcccttgtc gctfccacttc tacaggaggg gaagg-ggcca 2340 

gggaggggat fcctcccttta tcatcacctc agtfcfcccctg aattafcattt gggjggcaaag 2400 

attgtcccct ctgctgttct ctggggccgc tcagcacaga agaaggatga ggcjggctcag 2460 

cggggggagc tggcaccttc ctggagcctc cagccagtcc tgtcctccct cgccctacca 2520 

agagggcacc fcgaggagact ttggggacag ggcaggggca gggagggaaa ctgaggaaat 2580 

cttccattcc tcccaacagc tcaaaattag gcctfcgggca ggggcaggga gag-ctgctga 2 640 

gcctaaagac fcggagaatct gggggactgg gagtgggggt cagagaggca gattccttcc 2700 

cctcccgtcc cctcacgctc aaacccccac ttcctgcccc aggctggcgc gg-Q-gcacttt 2760 

gtacaaatcc ttgtaaatac cccacacctt cccttctgca aaggtctctt gacjgagcfcgc 2820 

cgctgtcacc tacggttttt aagtfcattac accccgaccc tcctcctgtc agccccctca 2880 

cgtgcagcct gttgcccaat aaattfcagga gagtcccccc ctccccaatg ctgaccctag 2940 

gatfcttcctt ccctgccctc acctgcaaat gagttaaaga agaggcgtgg gaatccaggc 3000 

agtggttttt cctttcggag cctcggtttt ctcatctgca gaafcgggagc ggtgggggtg 3060 

ggaaggtaag gatggtcgtc caagaaggca ggafcggaact cggcctcatc cccgaggccc 3120 

cagtfccctat atcgggcccc ccattcatcc actcacactc ccagccacca tgttacactg 3180 

gactttaagc catttcttac tccagtagta aatfctatfcca ataaac 3226 

<210> 24 

<211> 745 

<212> PET 

<213> Homo sapiens 

<400> 24 

Met He Arg Gly Arg Asn Ser Ala Thr Ser Ala Asp Glu Gin Pro His 



He Gly Asn Tyr Arg Leu Leu Lys Thr Xle Gly Lys Gly A'sxl Piie Ala 
20 25 30 

Lys Val Lys Leu Ala Arg His He Leu Thr Gly Lys Glu Val Ala Val 
35 40 45 

Lys He He Asp Lys Thr Gin Leu Asn Ser Ser Ser Leu Gin Lys Leu 
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50 55 60 

Phe Arg Glu Val Arg He Met Lys Val Lieu Asn His Pro Asn He Val 
65 70 75 80 

Lys Leu Plie Glu Val He Glu Thr Glu Lys Thr Leu Tyr Leu Val Met 
85 90 95 



Glu Tyr Ala Ser Gly Gly Glu Val Phe Asp Tyr Leu Val Ala His Gly 
100 ^ 105 110 



Arg Met Lys Glu Lys Glu Ala Arg Ala Lys Phe Arg Gin He Val Ser 
115 120 125 



Ala Val Gin Tyr Cys His Gin Lys Phe He Val His Arg Asp Leu Lys 
130 135 140 



Ala Glu Asn Leu Leu Leu Asp Ala Asp Met Asn He Lys He Ala Asp 

145 150 155 160 

Phe Gly Phe Ser Asn. Glu Phe Thr Phe Gly Asn Lys Leu Asp Thr Phe 

165 170 175 



Cys Gly Ser Pro Pro Tyr Ala Ala Pro Glu Leu Phe Gin Gly Lys Lys 
180 185 190 



Tyr Asp Gly Pro Glu Val Asp Val Trp Ser Leu Gly Val He Leu Tyr 
195 200 205 



Thr Leu Val Ser Gly Ser Leu Pro Phe Asp Gly Gin Asn Leu Lys Glu 
210 215 220 

Leu Arg Glu Arg Val Leu Arg Gly Lys Tyr Arg He Pro Phe Tyr Met 
225 230 235 240 



Ser Thr Asp Cys Glu Asn Leu Leu Lys Lys Phe Leu He Leu Asn Pro 
245 250 255 



Ser Lys Arg Gly Thr Leu Glu Gin He Met Lys Asp Arg Trp Met Asn 
260 265 270 



Val Gly His Glu Asp Asp Glu Leu Lys Pro Tyr Val Glu Pro Leu Pro 
275 280 285 



Asp Tyr Lys Asp Pro Arg Arg Thr Glu Leu Met Val Ser Met Gly Tyr 
290 295 300 
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Tlir Arg Glu Glu lie Gin Asp Ser Leu Val Gly Gin Arg Tyr Asn Glu 
305 310 315 320 



Val Met Ala Thr Tyr Leu Leu Leu Gly Tyr Lys Ser Ser Glu Leu Glu 
325 330 335 



Gly Asp Thr lie Thr Leu Lys Pro Arg Pro Ser Ala Asp Leu Thr Asn 
340 345 350 



Ser Ser Ala Gin Phe Pro Ser His Lys Val Gin Arg Ser Val Ser Ala 
355 360 365 



Asn Pro Lys Gin Arg Arg Phe Ser Asp Gin Ala Gly Pro Ala lie Pro 
370 ' 375 380 



Thr Ser Asn Ser Tyr Ser Lys Lys Thr Gin Ser Asn Asn Ala Glu Asn 
385 390 395 400 



Lys Arg Pro Glu Glu Asp Arg Glu Ser Gly Arg Lys Ala Ser Ser Thr 
405 410 415 



Ala Lys Val Pro Ala Ser Pro Leu Pro Gly Leu Glu Arg Lys Lys Thr 
420 425 430 



Thr Pro Thr Pro Ser Thr Asn Ser Val Leu Ser Thr Ser Thr Asn Arg 
435 440 445 



Ser Arg Asn Ser Pro Leu Leu Glu Arg Ala Ser Leu Gly Gin Ala Ser 
450 455 460 



He Gin Asn Gly Lys Asp Ser Leu Thr Met Pro Gly Ser Arg Ala Ser 
465 470 475 480 



Thr Ala Ser Ala Ser Ala Ala Val Ser Ala Ala Arg Pro Arg Gin His 
485 490 495 



Gin Lys Ser Met Ser Ala Ser Val His Pro Asn Lys Ala Ser Gly Leu 
500 505 510 



Pro Pro Thr Glu Ser Asn Cys Glu Val Pro Arg Pro Ser Thr Ala Pro 
515 520 525 



Gin Arg Val Pro Val Ala Ser Pro Ser Ala His Asn He Ser Ser Ser 
530 535 540 



Gly Gly Ala Pro Asp Arg Thr Asn Phe Pro Arg Gly Val Ser Ser Arg 
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560 



Ser Thr Plie His Ala Gly Gin lieu Arg Gin Val Arg Asp Gin Gin Asn 
565 570 575 

Leu Pro Tyr Gly Val Thr Pro Ala Ser Pro Ser Gly His Ser Gin Gly 
580 585 590 



Arg Arg Gly Ala Ser Gly Ser He Plie Ser Lys Plie Thr Ser Lys Phe 
595 600 605 

Val Arg Arg Asn Leu Asn Glu Pro Glu Ser Lys Asp Arg Val Glu Thr 
610 615 620 

Leu Arg Pro His Val Val Gly Ser Gly Gly Asn Asp Lys Glu Lys Glu 
625 ' 630 635 640 

Glu Phe Arg Glu Ala Lys Pro Arg Ser Leu Arg Phe Thr Trp Ser Met 
645 650 655 



Lys Thr Thr Ser Ser Met Glu Pro Asn Glu Met Met Arg Glu lie Arg 
660 665 670 



Lys Val Leu Asp Ala Asn Ser Cys Gin Ser Glu Leu His Glu Lys Tyr 
675 680 685 



Met Leu Leu Cys Met His Gly Thr Pro Gly His Glu Asp Phe Val Gin 
690 695 700 

Trp Glu Met Glu Val Cys Lys Leu Pro Arg Leu Ser Leu Asn Gly Val 
705 710 715 720 

Arg Phe Lys Arg He Ser Gly Thr Ser Met Ala Phe Lys Asn He Ala 
725 73 0 735 



Ser Lys He Ala Asn Glu Leu Lys Leu 
740 745 



<210> 25 

<211> 795 

<212> PRT 

<213> Homo sapiens 

<400> 25 

Met Ser Ala Arg Thr Pro Leu Pro Thr Val Asn Glu Arg Asp Thr Val 
15 10 15 
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Asn His Thr Thr Val Asp Gly Tyr Thr Glu Pro His lie Gin Pro Thr 
20 25 30 



Lys Ser Ser Ser Arg Gin Asn lie Pro Arg Cys Arg Asn Ser lie Thr 
35 40 45 



Ser Ala Thr Asp Glu Gin Pro His lie Gly Asn Tyr Arg Leu Gin Lys 
50 55 60 



Thr lie Gly Lys Gly Asn Phe Ala Lys Val Lys Leu Ala Arg His Val 
65 70 75 80 



Leu Thr Gly Arg Glu Val Ala Val Lys lie lie Asp Lys Thr Gin Leu 
85 90 95 



Asn Pro Thr Ser Leu Gin Lys Leu Phe Arg Glu Val Arg He Met Lys 
100 105 110 



Xle Leu Asn His Pro Asn He Val Lys Leu Phe Glu Val He Glu Thr 
115 120 125 



Glu Lys Thr Leu Tyr Leu Val Met Glu Tyr Ala Ser Gly Gly Glu Val 
130 135 140 



Phe Asp Tyr Leu Val Ala His Gly Arg Met Lys Glu Lys Glu Ala Arg 
145 150 155 160 



Ala Lys Phe Arg Gin lie Val Ser Ala Val Gin Tyr Cys His Gin Lys 
165 170 175 



Tyr He Val His Arg Asp Leu Lys Ala Glu Asn Leu Leu Leu Asp Gly 
180 185 190 



Asp Met Asn He Lys He Ala Asp Phe Gly Phe Ser Asn Glu Phe Thr 
195 200 205 



Val Gly Asn Lys Leu Asp Thr Phe Cys Gly Ser Pro Pro Tyr Ala Ala 
210 215 220 



Pro Glu Leu Phe Gin Gly Lys Lys Tyr Asp Gly Pro Glu Val Asp Val 
225 230 235 240 



Trp Ser Leu Gly Val He Leu Tyr Thr Leu Val Ser Gly Ser Leu Pro 
245 250 255 



Phe Asp Gly Gin Asn Leu Lys Glu Leu Arg Glu Arg Val Leu Arg Gly 
260 265 270 
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Lys Tyr Arg Xle Pro Phe Tyr Met Ser Thr Asp Cys Glu Asn Leu Leu 
275 280 285 



Lys Lys Leu Leu Val Leu Asn Pro He Lys Arg. Gly Ser Leu Glu Gin 
290 295 300 



He Met Lys Asp Arg; Trp Met Asn Val Gly His Glu Glu Glu Glu Leu 
305 310 315 320 

Lys Pro Tyr Thr Glu Pro Asp Pro Asp Phe Asn Asp Thr Lys Arg He 
325 330 335 



Asp Xle Met Val Thr Met Gly Phe Ala Arg Asp Glu He Asn Asp Ala 
340 345 350 



Leu He Asn Gin Lys Tyr Asp Glu Val Met Ala Thr Tyr He Leu Leu 
355 360 365 



Gly Arg Lys Pro Pro Glu Phe Glu Gly Gly Glu Ser Leu Ser Ser Gly 
370 375 380 



Asn Leu Cys Gin Arg Ser Arg Pro Ser Ser Asp Leu Asn Asn Ser Thr 
385 390 395 400 

Leu Gin Ser Pro Ala His Leu Lys Val Gin Arg Ser He Ser Ala Asn 
405 410 415 



Gin Lys Gin Arg Arg Phe Ser Asp His Ala Gly Pro Ser He Pro Pro 
420 425 430 



Ala Val Ser Tyr Thr Lys Arg Pro Gin Ala Asn Ser Val Glu Ser Glu 
435 440 445 



Gin Lys Glu Glu Trp Asp Lys Asp Val Ala Arg Lys Leu Gly Ser Thr 
450 455 460 



Thr Val Gly Ser Lys Ser Glu Met Thr Ala Ser Pro Leu Val Gly Pro 
465 470 475 480 



Glu Arg Lys Lys Ser Ser Thr He Pro Ser Asn Asn Val Tyr Ser Gly 
485 490 495 



Gly Ser Met Ala Arg Arg Asn Thr Tyr Val Cys Glu Arg Thr Thr Asp 
500 505 510 
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Arg Tyr Val Ala Leu Gin Asn Gly. Lys Asp Ser Ser Leu Thr Glu Met 
515 520 525 



Ser Val Ser Ser Xle Ser Ser Ala Gly Ser Ser Val Ala Ser Ala Val 
530 535 540 



Pro Ser Ala Arg- Pro Arg- His Gin Lys Ser Met Ser Thr Ser Gly His 
545 550 555 560 



Pro lie Lys Val Thr Leu Pro Thr lie Lys Asp Gly Ser Glu Ala Tyr 
565 570 575 



Arg Pro Gly Thr Thr Gin Arg Val Pro Ala Ala Ser Pro Ser Ala His 
580 585 v 590 



Ser lie Ser Thr Ala Thr Pro Asp Arg Thr Arg Phe Pro Arg Gly Ser 
595 600 605 



Ser Ser Arg Ser Thr Phe His Gly Glu Gin Leu Arg Glu Arg Arg Ser 
610 615 620 



Val Ala Tyr Asn Gly Pro Pro Ala Ser Pro Ser His Glu Thr Gly Ala 
625 630 635 640 



Phe Ala His Ala Arg Arg Gly Thr Ser Thr Gly lie lie Ser Lys lie 
645 650 655 



Thr Ser Lys Phe Val Arg Arg Asp Pro Ser Glu Gly Glu Ala Ser Gly 
660 665 670 



Arg Thr Asp Thr Ser Arg Ser Thr Ser Gly Glu Pro Lys Glu Arg Asp 
675 680 685 



Lys Glu Glu Gly Lys Asp Ser Lys Pro Arg Ser Leu Arg Phe Thr Trp 
690 695 700 



Ser Met Lys Thr Thr Ser Ser Met Asp Pro Asn Asp Met Met Arg Glu 
705 710 715 720 



He Arg Lys Val Leu Asp Ala Asn Asn Cys Asp Tyr Glu Gin Lys Glu 
725 730 735 



Arg Phe Leu Leu Phe Cys Val His Gly Asp Ala Arg Gin Asp Ser Leu 
740 745 750 



Val Gin Trp Glu Met Glu Val Cys Lys Leu Pro Arg Leu Ser Leu Asn 
755 760 765 
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Gly Val Arg Plie Lys Arg Xle Ser Gly Thr Ser lie Ala Phe Lys Asn. 
770 775 780 



He Ala Ser Lys lie Ala Asn Glu Leu Lys Leu 
785 790 795 



<210> 26 

<211> 729 

<212> PRT 

<213> Homo sapiens 

<400> 26 

Met Ser Thr Arg Thr Pro Leu Pro Thr Val Asn Glu Arg Asp Thr Glu 
15 10 15 

Asn His Thr Ser His Gly Asp Gly Arg Gin Glu Val Thr Ser Arg Thr 
20 25 30 



Ser Arg Ser Gly Ala Arg Cys Arg Asn Ser lie Ala Ser Cys Ala Asp 
35 40 45 



Glu Gin Pro His lie Gly Asn Tyr Arg Leu Leu Lys Thr lie Gly Lys 
50 55 60 



Gly Asn Phe Ala Lys Val Lys Leu Ala Arg His He Leu Thr Gly Arg 
65 70 75 80 

Glu Val Ala He Lys Xle He Asp Lys Thr Gin Leu Asn Pro Thr Ser 
85 90 95 



Leu Gin Lys Leu Phe Arg Glu Val Arg He Met Lys He Leu Asn His 
100 105 110 



Pro Asn He Val Lys Leu Phe Glu Val He Glu Thr Glu Lys Thr Leu 
115 120 12 5 



Tyr Leu He Met Glu Tyr Ala Ser Gly Gly Glu Val Phe Asp Tyr Leu 
130 135 140 

Val Ala His Gly Arg Met Lys Glu Lys Glu Ala Arg Ser Lys Phe Arg 
145 150 155 160 

Gin He Val Ser Ala Val Gin Tyr Cys His Gin Lys Arg He Val His 
165 170 175 



Arg Asp Leu Lys Ala Glu Asn Leu Leu Leu Asp Ala Asp Met Asn He 
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180 185 190 

Lys He Ala Asp Phe Gly Phe Ser Asn Glu Phe Thr Val Gly Gly Lys 
195 2 00 205 

Leu Asp Thr Phe Cys Gly Ser Pro Pro Tyr Ala Ala Pro Glu Leu Phe 
210 215 220 

Gin Gly Lys Lys Tyr Asp Gly Pro Glu Val Asp Val Trp Ser Leu Gly 
225 230 235 240 

Val lie Leu Tyr Thr Leu Val Ser Gly Ser Leu Pro Pile Asp Gly Gin 
245 250 255 

Asn Leu Lys Glu Leu Arg Glu Arg Val Leu Arg Gly Lys Tyr Arg He 
260 265 270 

Pro Phe Tyr Met Ser Thr Asp Cys Glu Asn Leu Leu Lys Arg Phe Leu 
275 280 285 

Val Leu Asn Pro He Lys Arg Gly Thr Leu Glu Gin lie Met Lys Asp 
290 295 300 

Arg Trp He Asn Ala Gly His Glu Glu Asp Glu Leu Lys Pro Plae Val 
305 310 315 320 

Glu Pro Glu Leu Asp He Ser Asp Gin Lys Arg He Asp He Met Val 
325 330 335 

Gly Met Gly Tyr Ser Gin Glu Glu He Gin Glu Ser Leu Ser Lys Met 
340 345 350 

Lys Tyr Asp Glu He Thr Ala Thr Tyr Leu Leu Leu Gly Arg Lys Ser 
355 360 365 

Ser Glu Leu Asp Ala Ser Asp Ser Ser Ser Ser Ser Asn Leu Ser Leu 
370 375 380 

Ala Lys Val Arg Pro Ser Ser Asp Leu Asn Asn Ser Thr Gly Gin Ser 
385 390 395 400 

Pro His His Lys Val Gin Arg Ser Val Ser Ser Ser Gin Lys Gin Arg 
405 410 415 

Arg Tyr Ser Asp His Ala Gly Pro Ala He Pro Ser Val Val Ala Tyr 
420 425 43 0 
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Pro Lys Arg Ser Gin Thr Ser Thr Ala Asp Gly Asp Leu Lys Glu Asp 
435 440 445 



Gly He Ser Ser Arg Lys Ser Ser Gly Ser Ala Val Gly Gly Lys Gly 
450 455 460 



He Ala Pro Ala Ser Pro Met Leu Gly Asn Ala Ser Asn Pro Asn Lys 
465 470 475 480 



Ala Asp lie Pro Glu Arg Lys Lys Ser Ser Thr Val Pro Ser Ser Asn 
485 490 495 



Thr Ala Ser Gly Gly Met Thr Arg Arg Asn Thr Tyr Val Cys Ser Glu 
500 505 510 



Arg Thr Thr Ala Asp Arg His Ser Val He Gin Asn Gly Lys Glu Asn 
515 520 525 



Ser Thr He Pro Asp Gin Arg Thr Pro Val Ala Ser Thr His Ser He 
530 535 540 



Ser Ser Ala Ala Thr Pro Asp Arg He Arg Phe Pro Arg Gly Thr Ala 
545 550 555 560 



Ser Arg Ser Thr Phe His Gly Gin Pro Arg Glu Arg Arg Thr Ala Thr 
565 570 575 



Tyr Asn Gly Pro Pro Ala Ser Pro Ser Leu Ser His Glu Ala Thr Pro 
580 585 590 



Leu Ser Gin Thr Arg Ser Arg Gly Ser Thr Asn Leu Phe Ser Lys Leu 
595 600 605 



Thr Ser Lys Leu Thr Arg Ser Arg Asn Val Ser Ala Glu Gin Lys Asp 
• 610 615 620 



Glu Asn Lys Glu Ala Lys Pro Arg Ser Leu Arg Phe Thr Trp Ser Met 
625 630 635 640 



Lys Thr Thr Ser Ser Met Asp Pro Gly Asp Met Met Arg Glu He Arg 
645 650 655 



Lys Val Leu Asp Ala Asn Asn Cys Asp Tyr Glu Gin Arg Glu Arg Phe 
660 665 670 



Leu Leu Phe Cys Val His Gly Asp Gly His Ala Glu Asn Leu Val Gin 
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675 680 685 

Trp Glu Met Glu Val Cys Lys Leu Pro Arg; Leu Ser Leu Asn Gly Val 
690 695 700 

Arg Plie Lys Arg lie Ser Gly Thr Ser lie Ala Phe Lys Asn lie Ala 
705 710 715 720 

Ser Lys He Ala Asn Glu Leu Lys Leu 
725 

<210> 27 

<211> 713 

<212> PRT 

<213> Homo sapiens 

<400> 27 

Met Ser Thr Arg Thr Pro Leu Pro Thr Val Asn Glu Arg Asp Thr Glu 
15 10 15 

Asn His Thr Ser His Gly Asp Gly Arg Gin Glu Val Thr Ser Arg Thr 
20 25 30 

Ser Arg Ser Gly Ala Arg Cys Arg Asn Ser He Ala Ser Cys Ala Asp 
35 40 45 

Glu Gin Pro His He Gly Asn Tyr Arg; Leu Leu Lys Thr He Gly Lys 
50 55 60 

Gly Asn Phe Ala Lys Val Lys Leu Ala Arg His He Leu Thr Gly Arg 
65 70 75 80 

Glu Val Ala He Lys He He Asp Lys Thr Gin Leu Asn Pro Thr Ser 
85 90 95 

Leu Gin Lys Leu Phe Arg Glu Val Arg; 'lie Met Lys He Leu Asn His 
100 105 110 

Pro Asn He Val Lys Leu Phe Glu Val He Glu Thr Gin Lys Thr Leu 
115 120 125 

Tyr Leu He Met Glu Tyr Ala Ser Gly Gly Lys Val Phe Asp Tyr Leu 
130 135 140 

Val Ala His Gly Arg Met Lys Glu Lys Glu Ala Arg Ser Lys Phe Arg 
145 150 155 160 
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Gin lie Val Ser Ala Val Gin Tyr Cys His Gin Lys Arg Xle Val His 
165 170 175 



Arg Asp Leu Lys Ala Glu Asn Leu Leu Leu Asp Ala Asp Met Asn lie 
180 185 190 



Lys He Ala Asp Phe Gly Phe Ser Asn Glu Phe Thr Val Gly Gly Lys 
195 200 205 



Leu Asp Thr Phe Cys Gly Ser Pro Pro Tyr Ala Ala Pro Glu Leu Phe 
210 215 220 



Gin Gly Lys Lys Tyr Asp Gly Pro Glu Val Asp Val Trp Ser Leu Gly 
225 230 235 240 



Val lie Leu Tyr Thr Leu Val Ser Gly Ser Leu Pro Phe Asp Gly Gin 
245 250 255 



Asn Leu Lys Glu Leu Arg Glu Arg Val- Leu Arg Gly Lys Tyr Arg Xle 
260 265 270 



Pro Phe Tyr Met Ser Thr Asp Cys Glu Asn Leu Leu Lys Arg Phe Leu 
275 280 285 



Val Leu Asn Pro He Lys Arg Gly Thr Leu Glu Gin He Met Lys Asp 
290 295 300 



Arg Trp lie Asn Ala Gly His Glu Glu Asp Glu Leu Lys Pro Phe Val 
305 310 315 320 



Glu Pro Glu Leu Asp Xle Ser Asp Gin Lys Arg He Asp He Met Val 
325 330 335 



Gly Met Gly Tyr Ser Gin Glu Glu He Gin Glu Ser Leu Ser Lys Met 
340 345 350 



Lys Tyr Asp Glu He Thr Ala Thr Tyr Leu Leu Leu Gly Arg Lys Ser 
355 360 365 



Ser Glu Val Arg Pro Ser Ser Asp Leu Asn Asn Ser Thr Gly Gin Ser 
370 375 380 



Pro His His Lys Val Gin Arg Ser Val Ser Ser Ser Gin Lys Gin Arg 
385 390 395 400 



Arg Tyr Ser Asp His Ala Gly Pro Gly He Pro Ser Val Val Ala Tyr 
405 410 415 



51 



_02098890A2J_> 



